Skip to content

Pull requests: lightseekorg/tokenspeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

deps: bump tokenspeed-trtllm-kernel to 1.3.0rc15.post20260522+full
#227 opened May 23, 2026 by aaronliuls Contributor Loading…
3 tasks
[WIP] perf(eagle3): skip dead-position compute in draft catch-up step
#217 opened May 22, 2026 by rjzhb Loading…
1 task done
fix(deepseek-v4): close MTP acceptance gap
#207 opened May 21, 2026 by Xiangyi1996 Loading…
ci: try new mi350 machines
#200 opened May 20, 2026 by antiagainst Member Loading…
fix(logits): avoid nan in fused softcap
#183 opened May 19, 2026 by elwhyjay Contributor Loading…
feat(trtllm-MHA): support mixed prefill/decode batches
#176 opened May 18, 2026 by rjzhb Loading…
4 tasks done
perf(moe): triton biased grouped topk for deepseek-v3 routing
#171 opened May 17, 2026 by roycho96 Contributor Loading…
wip: EAGLE post-norm
#170 opened May 17, 2026 by Dogacel Draft
feat(kvstore): support mamba l2 cache transfers high priority
#162 opened May 15, 2026 by XucSh Contributor Loading…
perf(sampling): opt-in fast verify path for topk=1 chain spec
#133 opened May 13, 2026 by cicirori Collaborator Draft
4 tasks done
[WIP] feat(lora): LoRA adapter serving
#83 opened May 11, 2026 by qywu Collaborator Draft
1 of 7 tasks
fix: retraction load back race condition.
#74 opened May 11, 2026 by LorrinWWW Contributor Loading…
fix: wait per-layer on drafter KV pool during cpu cache loadback
#6 opened May 6, 2026 by LorrinWWW Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.