-
Notifications
You must be signed in to change notification settings - Fork 605
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] Streaming API for Function Calling
#2576
opened Dec 25, 2024 by
HaoyuWang4188
•
Draft
3 tasks
[unittest] add unit test to test quant args of srt engine
#2574
opened Dec 25, 2024 by
JamesSand
Loading…
3 tasks done
Refactor SchedulePolicy to improve code organization
#2571
opened Dec 25, 2024 by
libratiger
Loading…
3 tasks done
Fix duplicated handling of GetWeightsByNameReqInput
#2565
opened Dec 24, 2024 by
fzyzcjy
Loading…
3 tasks
Error occurs when loading the gemma model in bitsandbytes format.
#2557
opened Dec 23, 2024 by
upskyy
Loading…
1 of 3 tasks
feat:support 2 kenrels for mixed chunked prefill
#2546
opened Dec 22, 2024 by
chosen-ox
Loading…
2 tasks
Enable Nvidia's ModelOpt fp8 quantized models
await-response
#2535
opened Dec 21, 2024 by
Edwardf0t1
Loading…
3 tasks
[Feature] Support new parameter - EBNF in xgrammar
#2526
opened Dec 19, 2024 by
adarshxs
Loading…
2 of 3 tasks
adapt custom allreduce for tensorrt llm
high priority
#2511
opened Dec 18, 2024 by
yizhang2077
Loading…
3 tasks
improve performance by removing use_tensor_core dependency
await-response
#2496
opened Dec 17, 2024 by
bjmsong
Loading…
3 tasks
[Experimental] Add a gRPC server for completion request
high priority
#2478
opened Dec 13, 2024 by
MrAta
Loading…
2 of 3 tasks
[FIX] Update EOS from config
await-response
#2475
opened Dec 13, 2024 by
zhengy001
Loading…
1 of 3 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.