Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Super tiny fix version
#2199 opened Dec 11, 2025 by fzyzcjy Loading…
5 tasks
Permute page table in benchmarking
#2194 opened Dec 10, 2025 by jhjpark Loading…
3 of 5 tasks
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Fix for moe on sm110
#2190 opened Dec 9, 2025 by jhalabi-nv Loading…
3 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention
#2148 opened Nov 28, 2025 by nvpohanh Draft
5 tasks
make DeepGEMM swapAB available for linear gemm SM90
#2131 opened Nov 22, 2025 by katec846 Loading…
3 of 5 tasks
perf: using multi-cta optimization for top-k/top-p
#2119 opened Nov 20, 2025 by yzh119 Loading…
4 of 5 tasks
Refactor trtllm_mnnvl_allreduce
#2118 opened Nov 20, 2025 by timlee0212 Loading…
5 tasks done
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
Port TRT-LLM communication kernels to flashinfer
#2102 opened Nov 18, 2025 by djns99 Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
ProTip! Follow long discussions with comments:>50.