Skip to content

Actions: vllm-project/vllm

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,997 workflow runs
1,997 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Rahul quant merged
PR Reminder Comment Bot #1997: Pull request #10341 opened by robertgshaw2-neuralmagic
November 14, 2024 20:47 14s
November 14, 2024 20:47 14s
[Kernel] Add CUTLASS sparse support, heuristics, and torch operators
PR Reminder Comment Bot #1996: Pull request #10340 opened by Faraz9877
November 14, 2024 20:41 12s
November 14, 2024 20:41 12s
[Perf] Reduce peak memory usage of llama
PR Reminder Comment Bot #1995: Pull request #10339 opened by andoorve
November 14, 2024 18:38 11s
November 14, 2024 18:38 11s
[Kernel] Add CUTLASS sparse support with argument sweep, heuristics, and torch operators
PR Reminder Comment Bot #1994: Pull request #10335 opened by Faraz9877
November 14, 2024 15:57 15s
November 14, 2024 15:57 15s
[bugfix] Fix static asymmetric quantization case
PR Reminder Comment Bot #1993: Pull request #10334 opened by ProExpertProg
November 14, 2024 15:46 18s
November 14, 2024 15:46 18s
[Tool parsing] Improve / correct mistral tool parsing
PR Reminder Comment Bot #1992: Pull request #10333 opened by patrickvonplaten
November 14, 2024 15:39 13s
November 14, 2024 15:39 13s
Nir b2b latest
PR Reminder Comment Bot #1991: Pull request #10332 opened by nirda7
November 14, 2024 15:34 18s
November 14, 2024 15:34 18s
[Docs] Publish meetup slides
PR Reminder Comment Bot #1990: Pull request #10331 opened by WoosukKwon
November 14, 2024 15:19 13s
November 14, 2024 15:19 13s
[Feature] enable host memory for kv cache
PR Reminder Comment Bot #1989: Pull request #10330 opened by YZP17121579
November 14, 2024 15:04 15s
November 14, 2024 15:04 15s
Rs 24 sparse
PR Reminder Comment Bot #1988: Pull request #10329 opened by robertgshaw2-neuralmagic
November 14, 2024 14:49 12s
November 14, 2024 14:49 12s
[Misc] Add uninitialized params tracking for AutoWeightsLoader
PR Reminder Comment Bot #1987: Pull request #10327 opened by Isotr0py
November 14, 2024 13:43 16s
November 14, 2024 13:43 16s
DistServe Prototype
PR Reminder Comment Bot #1986: Pull request #10321 opened by Jocn2020
November 14, 2024 10:10 17s
November 14, 2024 10:10 17s
[draft] Fix spec model init
PR Reminder Comment Bot #1985: Pull request #10320 opened by khluu
November 14, 2024 09:54 14s
November 14, 2024 09:54 14s
[ci][distributed] disable hanging tests
PR Reminder Comment Bot #1984: Pull request #10317 opened by youkaichao
November 14, 2024 07:40 14s
November 14, 2024 07:40 14s
[Hardware][Cambricon MLU] Add Cambricon MLU inference backend (#9649)
PR Reminder Comment Bot #1983: Pull request #10315 opened by zonghuaxiansheng
November 14, 2024 06:45 16s
November 14, 2024 06:45 16s
[CI/Build] Fix CPU CI online inference timeout
PR Reminder Comment Bot #1982: Pull request #10314 opened by Isotr0py
November 14, 2024 06:42 12s
November 14, 2024 06:42 12s
[Bugfix] Fix unable to load some models
PR Reminder Comment Bot #1981: Pull request #10312 opened by DarkLight1337
November 14, 2024 03:19 10s
November 14, 2024 03:19 10s
[Model] Support telechat2
PR Reminder Comment Bot #1980: Pull request #10311 opened by shunxing12345
November 14, 2024 02:53 15s
November 14, 2024 02:53 15s
[Model] Add BNB quantization support for Idefics3
PR Reminder Comment Bot #1979: Pull request #10310 opened by B-201
November 14, 2024 02:10 14s
November 14, 2024 02:10 14s
[Misc] Change RedundantReshapesPass and FusionPass logging from info to debug
PR Reminder Comment Bot #1978: Pull request #10308 opened by tlrmchlsmth
November 13, 2024 21:52 15s
November 13, 2024 21:52 15s
[TPU] Implement prefix caching for TPUs
PR Reminder Comment Bot #1977: Pull request #10307 opened by WoosukKwon
November 13, 2024 21:40 14s
November 13, 2024 21:40 14s
[Misc] format.sh: Simplify tool_version_check
PR Reminder Comment Bot #1976: Pull request #10305 opened by russellb
November 13, 2024 20:11 18s
November 13, 2024 20:11 18s
[misc] error early for old-style class
PR Reminder Comment Bot #1975: Pull request #10304 opened by youkaichao
November 13, 2024 19:18 15s
November 13, 2024 19:18 15s
[Feature] enable host memory for kv cache
PR Reminder Comment Bot #1974: Pull request #10302 opened by YZP17121579
November 13, 2024 17:43 12s
November 13, 2024 17:43 12s
[Bugfix] Fix tensor parallel for qwen2 classification model
PR Reminder Comment Bot #1973: Pull request #10297 opened by Isotr0py
November 13, 2024 15:08 17s
November 13, 2024 15:08 17s