Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Feature] Streaming API for Function Calling
#2576 opened Dec 25, 2024 by HaoyuWang4188 Draft
3 tasks
[unittest] add unit test to test quant args of srt engine
#2574 opened Dec 25, 2024 by JamesSand Loading…
3 tasks done
[Docs] add quantization docs
#2572 opened Dec 25, 2024 by JamesSand Loading…
3 tasks done
Refactor SchedulePolicy to improve code organization
#2571 opened Dec 25, 2024 by libratiger Loading…
3 tasks done
Fix duplicated handling of GetWeightsByNameReqInput
#2565 opened Dec 24, 2024 by fzyzcjy Loading…
3 tasks
Super tiny typo fix
#2564 opened Dec 24, 2024 by fzyzcjy Loading…
3 tasks
h100 tuning fused_moe_triton for qwen2 moe
#2560 opened Dec 23, 2024 by BBuf Loading…
3 tasks done
Error occurs when loading the gemma model in bitsandbytes format.
#2557 opened Dec 23, 2024 by upskyy Loading…
1 of 3 tasks
Fix cache hit rate when chunked prefill
#2555 opened Dec 23, 2024 by hnyls2002 Loading…
Fix packet loss when deploy little model
#2548 opened Dec 23, 2024 by sdli1995 Draft
3 tasks
feat:support 2 kenrels for mixed chunked prefill
#2546 opened Dec 22, 2024 by chosen-ox Loading…
2 tasks
[Feature] Function Tooling
#2544 opened Dec 22, 2024 by Tushar-ml Loading…
2 of 3 tasks
[Cache Offload] Remove device sync overhead
#2533 opened Dec 20, 2024 by Edenzzzz Loading…
3 tasks
[Feature] Support new parameter - EBNF in xgrammar
#2526 opened Dec 19, 2024 by adarshxs Loading…
2 of 3 tasks
fix: package data missing
#2521 opened Dec 19, 2024 by yudian0504 Loading…
Add generator-style run_batch function
#2513 opened Dec 18, 2024 by xingyaoww Loading…
adapt custom allreduce for tensorrt llm high priority
#2511 opened Dec 18, 2024 by yizhang2077 Loading…
3 tasks
torcho gemlite integration
#2498 opened Dec 17, 2024 by HDCharles Loading…
3 tasks
[Experimental] Add a gRPC server for completion request high priority
#2478 opened Dec 13, 2024 by MrAta Loading…
2 of 3 tasks
[FIX] Update EOS from config await-response
#2475 opened Dec 13, 2024 by zhengy001 Loading…
1 of 3 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.