Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feat] lora strength param diffusion SGLang Diffusion documentation Improvements or additions to documentation lora run-ci
#15691 opened Dec 23, 2025 by Prozac614 Loading…
4 of 6 tasks
update benchmark README to use --fp8-gemm-backend instead of env var deepseek documentation Improvements or additions to documentation nvidia
#15689 opened Dec 23, 2025 by leejnau Loading…
6 tasks
[sgl-kernel] feat: simplify tree_speculative_sampling_target_only documentation Improvements or additions to documentation sgl-kernel speculative-decoding
#15687 opened Dec 23, 2025 by alphabetc1 Loading…
6 tasks
[diffusion] Add kernel for svdquant quant LLM Quantization sgl-kernel
#15681 opened Dec 23, 2025 by jianyingzhu Draft
2 of 6 tasks
[Model] Add Ernie4.5 VL model support
#15679 opened Dec 23, 2025 by CSWYF3634076 Loading…
6 tasks
Add auto bind numa node
#15678 opened Dec 23, 2025 by QiuMike Loading…
6 tasks
[Disaggregation] Validate TP size compatibility for non-MLA models
#15675 opened Dec 23, 2025 by chi2liu Loading…
6 tasks done
[AMD] Fix Indexer fp8_index_kernel with ROCm tilelang backend
#15673 opened Dec 23, 2025 by wufann Loading…
6 tasks
[model-gateway] Optimize router selection with lock-free snapshots dependencies Pull requests that update a dependency file model-gateway router-benchmark run-ci
#15672 opened Dec 23, 2025 by ppraneth Loading…
6 tasks
Improve tp*pp error message
#15669 opened Dec 23, 2025 by Monokaix Loading…
6 tasks done
[diffusion] Use sage attn as default backend for RTX5090 diffusion SGLang Diffusion
#15668 opened Dec 23, 2025 by ryang-max Draft
6 tasks
Add kv_transfer_total_mb to metrics run-ci
#15667 opened Dec 23, 2025 by merrymercy Loading…
[Diffusion] Flux.1.dev support Tensor Parallel diffusion SGLang Diffusion run-ci
#15666 opened Dec 23, 2025 by BBuf Loading…
[Overlap Spec V2 Eagle] Support Triton spec v2 top k >1 and pagesize > 1
#15664 opened Dec 23, 2025 by Terry-Uv Loading…
1 of 8 tasks
[AMD] WIP - To fix CI hf amd run-ci
#15663 opened Dec 23, 2025 by yctseng0211 Loading…
6 tasks
Add logprob kit to test accuracy
#15661 opened Dec 23, 2025 by hnyls2002 Loading…
feat(multimodal): add load_mm_data_async for parallel image prefetching deepseek Multi-modal multi-modal language model
#15659 opened Dec 23, 2025 by xiaomin-D Loading…
2 of 6 tasks
use fp8 deepep dispatch for the ue8m0 quant mtp layer
#15658 opened Dec 23, 2025 by rainj-me Loading…
6 tasks done
Fix AttributeError in the trace_req_start function
#15657 opened Dec 23, 2025 by rexrock Loading…
6 tasks
[CI] Remove pcg-omni-ci run-ci
#15656 opened Dec 23, 2025 by Oasis-Git Loading…
6 tasks
ProTip! What’s not been updated in a month: updated:<2025-11-23.