-
Notifications
You must be signed in to change notification settings - Fork 605
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Enable Nvidia's ModelOpt fp8 quantized models
await-response
#2535
opened Dec 21, 2024 by
Edwardf0t1
Loading…
3 tasks
improve performance by removing use_tensor_core dependency
await-response
#2496
opened Dec 17, 2024 by
bjmsong
Loading…
3 tasks
[FIX] Update EOS from config
await-response
#2475
opened Dec 13, 2024 by
zhengy001
Loading…
1 of 3 tasks
[Bug] tp == 2 model gibberish
await-response
unable-reproduce
#2354
opened Dec 4, 2024 by
chalo2000
5 tasks done
[Bug] amdgpu,tp-size=2,Detected errors during sampling! NaN in the logits.
amd
await-response
#1953
opened Nov 8, 2024 by
linqingxu
5 tasks done
Surpport kv cache int8/int4 for triton backend
await-response
#1644
opened Oct 12, 2024 by
yuguo-Jack
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.