-
Notifications
You must be signed in to change notification settings - Fork 118
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: fix mrope calculation in the multimodal situation
#746
opened Jan 16, 2026 by
shan-chen-feng
Loading…
bugfix: fix acl_graph_executor not handling q_cu_seq_lens parameter for deepseekv3.2.
#742
opened Jan 16, 2026 by
zhang-minchao
Loading…
bugfix: enforce tool_choice parameter to control tool calling behavior
#737
opened Jan 15, 2026 by
QwertyJack
Loading…
bugfix: fix incorrect async implementation in rerank interface.
#728
opened Jan 15, 2026 by
RobbieLeung
Loading…
bugfix: return HTTP 400 instead of crashing when Content-Length header is missing
#727
opened Jan 14, 2026 by
QwertyJack
Loading…
feat: add rec kernel and builder for pure device pipeline[2/3].
#723
opened Jan 14, 2026 by
LMX-xin
Loading…
3 tasks
bugfix: fix streaming tool call missing function name for GLM-4.7
#722
opened Jan 14, 2026 by
QwertyJack
Loading…
5 tasks done
feat: support deepseek mla fused_mla_q/fused_mla_kv on mlu device.
#714
opened Jan 14, 2026 by
a120092009
Loading…
feat: add Cutlass support for Qwen3 W8A8 quantization.
#701
opened Jan 13, 2026 by
yingxudeng
•
Draft
feat: support audio modal input & refactor media decoder.
#682
opened Jan 8, 2026 by
xanecdotex
Loading…
feat: auto rebase PR branch onto target branch before build.
#654
opened Jan 6, 2026 by
yingxudeng
•
Draft
feat: implement HCCL distributed communication for the DiT model.
#622
opened Dec 30, 2025 by
z-jun03
Loading…
feat: introduce USE_NPU_TORCH flag for debugging and enhance NPU support for Qwen3-Dense[4/N].
torch_npu
#590
opened Dec 23, 2025 by
yingxudeng
•
Draft
feat: add xAttention support for Qwen3 generative recommendation.
#586
opened Dec 23, 2025 by
LMX-xin
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.