Skip to content

Actions: deepinfra/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
17 workflow runs
17 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

bugfix: Fix signature mismatch in benchmark's get_tokenizer functio…
clang-format #17: Commit c6db213 pushed by Pernekhan
January 13, 2025 16:58 18s main
January 13, 2025 16:58 18s
[Doc]Add documentation for using EAGLE in vLLM (#11417)
clang-format #16: Commit 973f5dc pushed by Pernekhan
January 7, 2025 23:55 17s main
January 7, 2025 23:55 17s
[Bugfix] Free cross attention block table for preempted-for-recompute…
clang-format #15: Commit 2f38518 pushed by Pernekhan
January 2, 2025 18:34 14s main
January 2, 2025 18:34 14s
[Model] Automatic conversion of classification and reward models (#11…
clang-format #14: Commit 3f3e92e pushed by NikolaBorisov
December 24, 2024 19:25 15s main
December 24, 2024 19:25 15s
[Misc]Reduce BNB static variable (#9987)
clang-format #13: Commit fb2716d pushed by Pernekhan
November 4, 2024 17:05 22s main
November 4, 2024 17:05 22s
[Doc] Include performance benchmark in README (#9135)
clang-format #12: Commit c0d9a98 pushed by Pernekhan
October 7, 2024 22:42 20s main
October 7, 2024 22:42 20s
[Doc] Update doc for Transformers 4.45 (#8817)
clang-format #11: Commit e2c6e0a pushed by Pernekhan
September 25, 2024 21:45 19s main
September 25, 2024 21:45 19s
[Misc] Support FP8 MoE for compressed-tensors (#8588)
clang-format #10: Commit 873edda pushed by Pernekhan
September 25, 2024 17:24 16s main
September 25, 2024 17:24 16s
[Bugfix] Fix 3.12 builds on main (#8510)
clang-format #9: Commit cca6164 pushed by Pernekhan
September 17, 2024 01:06 21s main
September 17, 2024 01:06 21s
[Frontend] Expose revision arg in OpenAI server (#8501)
clang-format #8: Commit 837c196 pushed by Pernekhan
September 16, 2024 16:29 20s main
September 16, 2024 16:29 20s
[Hotfix][VLM] Fixing max position embeddings for Pixtral (#8399)
clang-format #7: Commit 520ca38 pushed by Pernekhan
September 12, 2024 16:47 18s main
September 12, 2024 16:47 18s
[Misc] Fused MoE Marlin support for GPTQ (#8217)
clang-format #6: Commit 6cd5e5b pushed by Pernekhan
September 10, 2024 03:34 17s main
September 10, 2024 03:34 17s
[Misc] GPTQ Activation Ordering (#8135)
clang-format #5: Commit c7cb5c3 pushed by Pernekhan
September 10, 2024 01:15 20s main
September 10, 2024 01:15 20s
July 31, 2024 00:33 25s
Add FlashInfer to default Dockerfile (#6172)
clang-format #3: Commit 4f0e0ea pushed by Pernekhan
July 8, 2024 22:44 23s main
July 8, 2024 22:44 23s
July 8, 2024 17:46 16s
Support Deepseek-V2 (#4650)
clang-format #1: Commit be0b3af pushed by Pernekhan
June 28, 2024 20:45 21s main
June 28, 2024 20:45 21s