Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Collaborator Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
Refactor proxy server improvement
#4596 opened May 18, 2026 by lvhan028 Collaborator Draft
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
log reponse for debugging
#4592 opened May 18, 2026 by lvhan028 Collaborator Loading…
fix: enable FA3 for SM80+ GPUs and fix CUDA version comparison Bug:P1
#4591 opened May 18, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
tool calling alignment with openai's spec improvement
#4585 opened May 13, 2026 by lvhan028 Collaborator Loading…
Add OpenAI Responses-compatible endpoint enhancement New feature or request
#4582 opened May 13, 2026 by CUHKSZzxy Collaborator Loading…
[security] fix(proxy): require auth for node management
#4579 opened May 11, 2026 by Hinotoi-agent Loading…
5 of 9 tasks
feat: configure cudagraph capture batch sizes
#4573 opened May 8, 2026 by CUHKSZzxy Collaborator Draft
Fix health latency under concurrent VL request preparation Bug:P0
#4570 opened May 7, 2026 by CUHKSZzxy Collaborator Loading…
LLM evaluation skill on text datasets
#4566 opened Apr 30, 2026 by lvhan028 Collaborator Loading…
FP8 kv cache quantization enhancement New feature or request
#4563 opened Apr 29, 2026 by CUHKSZzxy Collaborator Loading…
[Feature] Add guided decoding support for speculative decoding enhancement New feature or request
#4559 opened Apr 28, 2026 by windreamer Collaborator Loading…
4 tasks done
DeepSeek V4 support
#4554 opened Apr 24, 2026 by grimoire Collaborator Loading…
Test: update sleep/wakeup and abort scenarios
#4528 opened Apr 15, 2026 by littlegy Contributor Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524 opened Apr 14, 2026 by windreamer Collaborator Loading…
make fp8 model quantized by llm-compressor can be inferenced in turbomind enhancement New feature or request
#4509 opened Apr 8, 2026 by 43758726 Collaborator Loading…
Integrate deep-ep nccl backend enhancement New feature or request
#4477 opened Mar 27, 2026 by irexyc Collaborator Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.