-
Notifications
You must be signed in to change notification settings - Fork 178
Issues: flashinfer-ai/flashinfer
Deprecation Notice: Python 3.8 Wheel Support to End in future...
#682
opened Dec 18, 2024 by
yzh119
Open
2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
C++ benchmarks CMake error caused by enable_fp16 option in generate.py
#734
opened Jan 13, 2025 by
rtxxxpro
[RFC]: Introducing ReproSpec for Strong Reproducibility in LLM Inference
#733
opened Jan 11, 2025 by
yzh119
Inconsistent results between different sequences with sequence lengths less than a single page size
#725
opened Jan 8, 2025 by
fergusfinn
RuntimeError: Qwen2-VL does not support _Backend.FLASHINFER backend now
#720
opened Jan 7, 2025 by
duzw9311
top_p_renorm_prob is not numerically stable for small probability values
#708
opened Dec 30, 2024 by
merrymercy
[Question] How to support custom stride of paged_kv for hopper prefill attention
#702
opened Dec 27, 2024 by
jianfei-wangg
Different sequence numbers calculate inconsistent results
#696
opened Dec 24, 2024 by
sitabulaixizawaluduo
Deprecation Notice: Python 3.8 Wheel Support to End in future releases
#682
opened Dec 18, 2024 by
yzh119
[Bug] FlashInfer latest main wheel issue
bug
Something isn't working
priority: high
#669
opened Dec 16, 2024 by
zhyncs
[Question] Overflow risks when batch size and sequence length grows extremely large
#596
opened Nov 8, 2024 by
rchardx
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.