-
Notifications
You must be signed in to change notification settings - Fork 264
Comparing changes
Open a pull request
base repository: ModelTC/lightllm
base: v1.0.0
head repository: ModelTC/lightllm
compare: v1.0.1
- 14 commits
- 76 files changed
- 13 contributors
Commits on Feb 19, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 5c28b33 - Browse repository at this point
Copy the full SHA 5c28b33View commit details -
Co-authored-by: shihaobai <baishihao@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 971936e - Browse repository at this point
Copy the full SHA 971936eView commit details
Commits on Feb 20, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 57bb6e1 - Browse repository at this point
Copy the full SHA 57bb6e1View commit details -
add RETURN_LIST for tgi_api (#742)
Co-authored-by: shihaobai <baishihao@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 9313a08 - Browse repository at this point
Copy the full SHA 9313a08View commit details
Commits on Feb 21, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 00e4de7 - Browse repository at this point
Copy the full SHA 00e4de7View commit details
Commits on Feb 22, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 808d832 - Browse repository at this point
Copy the full SHA 808d832View commit details -
deepseekv3 bmm noquant and fix moe gemm bug. (#745)
Co-authored-by: shihaobai <baishihao@sensetime.com> Co-authored-by: shihaobai <42648726+shihaobai@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c483b1e - Browse repository at this point
Copy the full SHA c483b1eView commit details
Commits on Feb 26, 2025
-
Change: - change launch argument from `--simple_constraint_mode` to `--output_constraint_mode`, now user can choose the constriant decode backend from ['outlines', 'xgrammar'] - add `XgrammarBackend` used for xgrammar constraint decode, maybe we should merge it with `SimpleConstraintBackend` later? - now we adopt the same request body, the same as vLLM with xgrammar(https://docs.vllm.ai/en/stable/serving/openai_compatible_server.html) - user can add `guided_grammar` to pass a EBNF grammar and execute the constraint decode - user can add `guided_json` to pass a standard json schema and do the constraint decode --------- Co-authored-by: hiworldwzj <30762946+hiworldwzj@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c8c892a - Browse repository at this point
Copy the full SHA c8c892aView commit details -
fuse fp8 quant in kv copying and add flashinfer decode mla operator i…
…n the attention module (#737)
Configuration menu - View commit details
-
Copy full SHA for d7c0a4b - Browse repository at this point
Copy the full SHA d7c0a4bView commit details -
fix: add flashinfer-python in the requirements.txt (#749)
Co-authored-by: niushengxiao <niushengxiao@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 39d9a59 - Browse repository at this point
Copy the full SHA 39d9a59View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6f7a41 - Browse repository at this point
Copy the full SHA f6f7a41View commit details
Commits on Feb 27, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 4a974c2 - Browse repository at this point
Copy the full SHA 4a974c2View commit details
Commits on Mar 1, 2025
-
add support for multinode tp (#751)
Co-authored-by: wufeiyang <jayfeather9@qq.com> Co-authored-by: root <root@pt-290ac8041d114af0b1647509a5544872-master-0.pt-290ac8041d114af0b1647509a5544872.ns-devoversea-d41e68bd.svc.cluster.local> Co-authored-by: shihaobai <baishihao@sensetime.com> Co-authored-by: root <root@pt-511f450a52c24c2d9df9b20f0c8ebdb7-master-0.pt-511f450a52c24c2d9df9b20f0c8ebdb7.ns-devoversea-d41e68bd.svc.cluster.local> Co-authored-by: Feiyang Wu <wufeiyang@sensetime.com> Co-authored-by: wangzaijun <wzjhelloworld@qq.com> Co-authored-by: hiworldwzj <30762946+hiworldwzj@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2dbd8e5 - Browse repository at this point
Copy the full SHA 2dbd8e5View commit details
Commits on Mar 3, 2025
-
fix /tokens docs && tag v1.0.1 (#753)
Co-authored-by: shihaobai <baishihao@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for b418bdf - Browse repository at this point
Copy the full SHA b418bdfView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.0.0...v1.0.1