-
Notifications
You must be signed in to change notification settings - Fork 14
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Feature] [TPU host offload] sglang bench_serving tool example
#892
opened Oct 18, 2025 by
saikat-royc
Loading…
[CI] remove lora_bias_stacked as it is deprecated in vllm
#835
opened Oct 11, 2025 by
bzgoogle
Loading…
feat: Add a procedures to record the vllm and tpu_inference's commit hashes in CI pipeline (WIP)
#795
opened Oct 7, 2025 by
dennisYehCienet
Loading…
[do nor review/merge] qwen 3 working microbenchmark
#632
opened Sep 3, 2025 by
mailvijayasingh
Loading…
[Draft-WIP- Do Not Merge] RPA Kernel v3 MB for llama
#571
opened Aug 26, 2025 by
mailvijayasingh
Loading…
Create TorchaxMergedColumnParallelLinearWithLoRA lora wrapper for single chip
#496
opened Aug 18, 2025 by
vanbasten23
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.