-
Notifications
You must be signed in to change notification settings - Fork 10
Commits on Jun 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e69d23b - Browse repository at this point
Copy the full SHA e69d23bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 81ec16b - Browse repository at this point
Copy the full SHA 81ec16bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5500975 - Browse repository at this point
Copy the full SHA 5500975View commit details -
Configuration menu - View commit details
-
Copy full SHA for b913d04 - Browse repository at this point
Copy the full SHA b913d04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 683a30b - Browse repository at this point
Copy the full SHA 683a30bView commit details -
[Build/CI] Enabling AMD Entrypoints Test (vllm-project#4834)
Co-authored-by: Alexey Kondratiev <alexey.kondratiev@amd.com>
Configuration menu - View commit details
-
Copy full SHA for c8794c3 - Browse repository at this point
Copy the full SHA c8794c3View commit details -
[Bugfix] Fix dummy weight for fp8 (vllm-project#4916)
Allow dummy load format for fp8, torch.uniform_ doesn't support FP8 at the moment Co-authored-by: Mor Zusman <morz@ai21.com>
Configuration menu - View commit details
-
Copy full SHA for 5b6a7b5 - Browse repository at this point
Copy the full SHA 5b6a7b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for a5e66c7 - Browse repository at this point
Copy the full SHA a5e66c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a78ed8 - Browse repository at this point
Copy the full SHA 8a78ed8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b46dcf - Browse repository at this point
Copy the full SHA 6b46dcfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 907d48a - Browse repository at this point
Copy the full SHA 907d48aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 11d6f7e - Browse repository at this point
Copy the full SHA 11d6f7eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d98989 - Browse repository at this point
Copy the full SHA 5d98989View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58a235b - Browse repository at this point
Copy the full SHA 58a235bView commit details -
[Bugfix] Fix flag name for
max_seq_len_to_capture
(vllm-project#4935)Signed-off-by: kerthcet <kerthcet@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 253d8fb - Browse repository at this point
Copy the full SHA 253d8fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for f744125 - Browse repository at this point
Copy the full SHA f744125View commit details -
Configuration menu - View commit details
-
Copy full SHA for c1672a9 - Browse repository at this point
Copy the full SHA c1672a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b6c961 - Browse repository at this point
Copy the full SHA 4b6c961View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b74974 - Browse repository at this point
Copy the full SHA 4b74974View commit details -
[Kernel] Fixup for CUTLASS kernels in CUDA graphs (vllm-project#4954)
Pass the CUDA stream into the CUTLASS GEMMs, to avoid future issues with CUDA graphs
Configuration menu - View commit details
-
Copy full SHA for 39c15ee - Browse repository at this point
Copy the full SHA 39c15eeView commit details -
[Misc] Load FP8 kv-cache scaling factors from checkpoints (vllm-proje…
…ct#4893) The 2nd PR for vllm-project#4532. This PR supports loading FP8 kv-cache scaling factors from a FP8 checkpoint (with .kv_scale parameter).
Configuration menu - View commit details
-
Copy full SHA for 2835fc6 - Browse repository at this point
Copy the full SHA 2835fc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3db99a6 - Browse repository at this point
Copy the full SHA 3db99a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 39a0a40 - Browse repository at this point
Copy the full SHA 39a0a40View commit details -
Configuration menu - View commit details
-
Copy full SHA for 847ca88 - Browse repository at this point
Copy the full SHA 847ca88View commit details -
Configuration menu - View commit details
-
Copy full SHA for c60384c - Browse repository at this point
Copy the full SHA c60384cView commit details -
Configuration menu - View commit details
-
Copy full SHA for dae5aaf - Browse repository at this point
Copy the full SHA dae5aafView commit details -
[Bugfix] Update Dockerfile.cpu to fix NameError: name 'vllm_ops' is n…
…ot defined (vllm-project#5009)
Configuration menu - View commit details
-
Copy full SHA for 05a4f64 - Browse repository at this point
Copy the full SHA 05a4f64View commit details -
[Core][1/N] Support send/recv in PyNCCL Groups (vllm-project#4988)
Signed-off-by: Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
Configuration menu - View commit details
-
Copy full SHA for bf4c411 - Browse repository at this point
Copy the full SHA bf4c411View commit details -
[Kernel] Initial Activation Quantization Support (vllm-project#4525)
Co-authored-by: Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for c623663 - Browse repository at this point
Copy the full SHA c623663View commit details -
[Core]: Option To Use Prompt Token Ids Inside Logits Processor (vllm-…
…project#4985) Co-authored-by: Elisei Smirnov <el.smirnov@innopolis.university>
Configuration menu - View commit details
-
Copy full SHA for a9ca32d - Browse repository at this point
Copy the full SHA a9ca32dView commit details -
[Doc] add ccache guide in doc (vllm-project#5012)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for 0eb33b1 - Browse repository at this point
Copy the full SHA 0eb33b1View commit details -
[Kernel] Initial Activation Quantization Support (vllm-project#4525)
Co-authored-by: Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for acf362c - Browse repository at this point
Copy the full SHA acf362cView commit details -
[Core][Bugfix]: fix prefix caching for blockv2 (vllm-project#4764)
Co-authored-by: Lei Wen <wenlei03@qiyi.com>
Configuration menu - View commit details
-
Copy full SHA for 1226d5d - Browse repository at this point
Copy the full SHA 1226d5dView commit details -
[Kernel][Backend][Model] Blocksparse flash attention kernel and Phi-3…
…-Small model (vllm-project#4799) Co-authored-by: beagleski <yunanzhang@microsoft.com> Co-authored-by: bapatra <bapatra@microsoft.com> Co-authored-by: Barun Patra <codedecde@users.noreply.github.com> Co-authored-by: Michael Goin <michael@neuralmagic.com>
Configuration menu - View commit details
-
Copy full SHA for 29a2098 - Browse repository at this point
Copy the full SHA 29a2098View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fe7e52 - Browse repository at this point
Copy the full SHA 3fe7e52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8768b3f - Browse repository at this point
Copy the full SHA 8768b3fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e7e376f - Browse repository at this point
Copy the full SHA e7e376fView commit details -
[Bugfix / Core] Prefix Caching Guards (merged with main) (vllm-projec…
…t#4846) Co-authored-by: rsnm2 <rshaw@neuralmagic.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 67ce9ea - Browse repository at this point
Copy the full SHA 67ce9eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c59c91 - Browse repository at this point
Copy the full SHA 2c59c91View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9fb7b82 - Browse repository at this point
Copy the full SHA 9fb7b82View commit details -
[Core] Sliding window for block manager v2 (vllm-project#4545)
Co-authored-by: Ruth Evans <ruthevans@Ruths-MacBook-Pro.local>
Configuration menu - View commit details
-
Copy full SHA for 954c332 - Browse repository at this point
Copy the full SHA 954c332View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9929fb2 - Browse repository at this point
Copy the full SHA 9929fb2View commit details -
[Kernel][ROCm][AMD] Add fused_moe Triton configs for MI300X (vllm-pro…
…ject#4951) This PR adds Triton kernel configs for the MoE kernel for MI300X
Configuration menu - View commit details
-
Copy full SHA for b22d985 - Browse repository at this point
Copy the full SHA b22d985View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54c17a9 - Browse repository at this point
Copy the full SHA 54c17a9View commit details -
[Core] Consolidate prompt arguments to LLM engines (vllm-project#4328)
Co-authored-by: Roger Wang <ywang@roblox.com>
Configuration menu - View commit details
-
Copy full SHA for 8c9aab4 - Browse repository at this point
Copy the full SHA 8c9aab4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 705789d - Browse repository at this point
Copy the full SHA 705789dView commit details -
[Misc] add gpu_memory_utilization arg (vllm-project#5079)
Signed-off-by: pandyamarut <pandyamarut@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 95c2a3d - Browse repository at this point
Copy the full SHA 95c2a3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9175890 - Browse repository at this point
Copy the full SHA 9175890View commit details -
Configuration menu - View commit details
-
Copy full SHA for 420c4ff - Browse repository at this point
Copy the full SHA 420c4ffView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5bde5ba - Browse repository at this point
Copy the full SHA 5bde5baView commit details -
[Core] Cross-attention KV caching and memory-management (towards even…
…tual encoder/decoder model support) (vllm-project#4837)
Configuration menu - View commit details
-
Copy full SHA for b86aa89 - Browse repository at this point
Copy the full SHA b86aa89View commit details -
Configuration menu - View commit details
-
Copy full SHA for f63e8dd - Browse repository at this point
Copy the full SHA f63e8ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 62a4fcb - Browse repository at this point
Copy the full SHA 62a4fcbView commit details -
Configuration menu - View commit details
-
Copy full SHA for f900bcc - Browse repository at this point
Copy the full SHA f900bccView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6824b2f - Browse repository at this point
Copy the full SHA 6824b2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 623275f - Browse repository at this point
Copy the full SHA 623275fView commit details -
[Bugfix / Core] Prefix Caching Guards (merged with main) (vllm-projec…
…t#4846) Co-authored-by: rsnm2 <rshaw@neuralmagic.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 15dcd3e - Browse repository at this point
Copy the full SHA 15dcd3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5763c73 - Browse repository at this point
Copy the full SHA 5763c73View commit details -
[CI/Build] Docker cleanup functionality for amd servers (vllm-project…
…#5112) Co-authored-by: Alexey Kondratiev <alexey.kondratiev@amd.com> Co-authored-by: Alexei-V-Ivanov-AMD <156011006+Alexei-V-Ivanov-AMD@users.noreply.github.com> Co-authored-by: Alexei V. Ivanov <alexei.ivanov@amd.com> Co-authored-by: omkarkakarparthi <okakarpa>
Configuration menu - View commit details
-
Copy full SHA for 3a8332c - Browse repository at this point
Copy the full SHA 3a8332cView commit details -
[BUGFIX] [FRONTEND] Correct chat logprobs (vllm-project#5029)
Co-authored-by: Breno Faria <breno.faria@intrafind.com>
Configuration menu - View commit details
-
Copy full SHA for 11a5a26 - Browse repository at this point
Copy the full SHA 11a5a26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2827c68 - Browse repository at this point
Copy the full SHA 2827c68View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ae80dd - Browse repository at this point
Copy the full SHA 4ae80ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 886ead6 - Browse repository at this point
Copy the full SHA 886ead6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 758b903 - Browse repository at this point
Copy the full SHA 758b903View commit details -
add doc about serving option on dstack (vllm-project#3074)
Co-authored-by: Roger Wang <ywang@roblox.com>
Configuration menu - View commit details
-
Copy full SHA for a190463 - Browse repository at this point
Copy the full SHA a190463View commit details -
Configuration menu - View commit details
-
Copy full SHA for 51cf757 - Browse repository at this point
Copy the full SHA 51cf757View commit details -
Configuration menu - View commit details
-
Copy full SHA for c72d890 - Browse repository at this point
Copy the full SHA c72d890View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf0711b - Browse repository at this point
Copy the full SHA cf0711bView commit details -
[Kernel] Marlin_24: Ensure the mma.sp instruction is using the ::orde…
…red_metadata modifier (introduced with PTX 8.5) (vllm-project#5136)
Configuration menu - View commit details
-
Copy full SHA for dcaf819 - Browse repository at this point
Copy the full SHA dcaf819View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7da3c3f - Browse repository at this point
Copy the full SHA 7da3c3fView commit details -
[Model] Support MAP-NEO model (vllm-project#5081)
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 2c66f17 - Browse repository at this point
Copy the full SHA 2c66f17View commit details -
Revert "[Kernel] Marlin_24: Ensure the mma.sp instruction is using th…
…e ::ordered_metadata modifier (introduced with PTX 8.5)" (vllm-project#5149)
Configuration menu - View commit details
-
Copy full SHA for 5388c64 - Browse repository at this point
Copy the full SHA 5388c64View commit details -
[Misc]: optimize eager mode host time (vllm-project#4196)
Co-authored-by: xuhao <xuhao@cambricon.com>
Configuration menu - View commit details
-
Copy full SHA for 5e9f300 - Browse repository at this point
Copy the full SHA 5e9f300View commit details -
Configuration menu - View commit details
-
Copy full SHA for f329e2e - Browse repository at this point
Copy the full SHA f329e2eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 951e3d2 - Browse repository at this point
Copy the full SHA 951e3d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for d349dbd - Browse repository at this point
Copy the full SHA d349dbdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 031fd4e - Browse repository at this point
Copy the full SHA 031fd4eView commit details
Commits on Jun 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9ed5f76 - Browse repository at this point
Copy the full SHA 9ed5f76View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec71544 - Browse repository at this point
Copy the full SHA ec71544View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7381340 - Browse repository at this point
Copy the full SHA 7381340View commit details -
Configuration menu - View commit details
-
Copy full SHA for c23ca05 - Browse repository at this point
Copy the full SHA c23ca05View commit details -
Configuration menu - View commit details
-
Copy full SHA for 85512eb - Browse repository at this point
Copy the full SHA 85512ebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0cea2c2 - Browse repository at this point
Copy the full SHA 0cea2c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 31147df - Browse repository at this point
Copy the full SHA 31147dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2256610 - Browse repository at this point
Copy the full SHA 2256610View commit details -
Configuration menu - View commit details
-
Copy full SHA for 01973f5 - Browse repository at this point
Copy the full SHA 01973f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for a1a659d - Browse repository at this point
Copy the full SHA a1a659dView commit details -
Configuration menu - View commit details
-
Copy full SHA for c50784c - Browse repository at this point
Copy the full SHA c50784cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 99fa9f8 - Browse repository at this point
Copy the full SHA 99fa9f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ec6643 - Browse repository at this point
Copy the full SHA 2ec6643View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bb099c - Browse repository at this point
Copy the full SHA 0bb099cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 198f364 - Browse repository at this point
Copy the full SHA 198f364View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec0e89a - Browse repository at this point
Copy the full SHA ec0e89aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6f1cbd - Browse repository at this point
Copy the full SHA e6f1cbdView commit details -
Configuration menu - View commit details
-
Copy full SHA for ca8d74a - Browse repository at this point
Copy the full SHA ca8d74aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5335ad9 - Browse repository at this point
Copy the full SHA 5335ad9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 611cfed - Browse repository at this point
Copy the full SHA 611cfedView commit details -
Configuration menu - View commit details
-
Copy full SHA for 73132a5 - Browse repository at this point
Copy the full SHA 73132a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f5c715 - Browse repository at this point
Copy the full SHA 7f5c715View commit details
Commits on Jun 10, 2024
-
4
Configuration menu - View commit details
-
Copy full SHA for 437912e - Browse repository at this point
Copy the full SHA 437912eView commit details -
4
Configuration menu - View commit details
-
Copy full SHA for 950981c - Browse repository at this point
Copy the full SHA 950981cView commit details