Open
Description
For old version of cuda (<11.8), flashinfer didn't compile because we include fp8 headers directly without guard, which cause trouble because old cuda didn't provide these headers.
We should fix the behavior by properly adding guard macros to avoid including fp8 headers when FLASHINFER_ENABLE_FP8
is turned off.
Metadata
Assignees
Labels
No labels