Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP][Bugfix] Fix MLA attention crash with AWQ/GPTQ quantized models bug Something isn't working
#34695 opened Feb 17, 2026 by haosdent Loading…
[Bugfix] Fix mypy errors for StructuredOutputsParams by using stdlib dataclass bug Something isn't working
#34693 opened Feb 17, 2026 by hyeongyun0916 Loading…
3 of 5 tasks
[ROCm] Enable DeepEP ROCm as all2allbackend for AMD GPUs. rocm Related to AMD ROCm
#34692 opened Feb 17, 2026 by lcskrishna Draft
5 tasks
[ROCm] Enable bitsandbytes quantization support on ROCm ci/build documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#34688 opened Feb 17, 2026 by Abdennacer-Badaoui Loading…
1 task done
[Docs]Fix documentation formatting in architecture overview documentation Improvements or additions to documentation
#34679 opened Feb 17, 2026 by lichuang Loading…
5 tasks
[GGUF][Model] Add Qwen3-Coder-Next GGUF support multi-modality Related to multi-modality (#4194) qwen Related to Qwen models
#34678 opened Feb 17, 2026 by rudybear Loading…
6 tasks done
[Bugfix][CPU] Fix basic unit tests failing in CPU platforms bug Something isn't working nvidia
#34677 opened Feb 17, 2026 by jasonyanwenl Loading…
3 of 5 tasks
Add VLLM_SKIP_MODEL_VALIDATION environment variable frontend
#34676 opened Feb 17, 2026 by dsingal0 Loading…
5 tasks
Adding Nemotron fp8 Triton MoE Config
#34674 opened Feb 17, 2026 by yugong333 Loading…
5 tasks
Update max_num_tokens value when specdec is enabled v1
#34671 opened Feb 17, 2026 by shaharmor98 Loading…
5 tasks
[Core] Default to "mp" rather than "uni" distributed executor backend ready ONLY add when PR is ready to merge/full CI is needed v1
#34670 opened Feb 17, 2026 by njhill Loading…
[Bugfix] Fix benchmark_fused_collective crash on CustomOp init bug Something isn't working performance Performance-related issues
#34665 opened Feb 17, 2026 by mayank-ketkar-sf Loading…
3 tasks done
Add MXFP8 to Marlin dense kernel
#34664 opened Feb 17, 2026 by mgoin Loading…
5 tasks
Separate TRTLLM and Flashinfer backends documentation Improvements or additions to documentation nvidia v1
#34663 opened Feb 17, 2026 by pavanimajety Draft
5 tasks
[Kernel][Perf] Fuse gather_block_tables + compute_slot_mappings into single kernel performance Performance-related issues v1
#34660 opened Feb 17, 2026 by mayank-ketkar-sf Loading…
3 of 5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.