-
-
Notifications
You must be signed in to change notification settings - Fork 4k
Issues: vllm-project/vllm
[RFC]: Reimplement and separate beam search on top of vLLM core
#8306
opened Sep 9, 2024 by
youkaichao
Open
6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Not to be Submitted] [WIP] Force Unit tests to run with BlockManager V2
ready
ONLY add when PR is ready to merge/full CI is needed
[Core] Factor out common code in ONLY add when PR is ready to merge/full CI is needed
SequenceData
and Sequence
ready
#8675
opened Sep 20, 2024 by
DarkLight1337
Loading…
[Core] Rename ONLY add when PR is ready to merge/full CI is needed
PromptInputs
to PromptType
, and inputs
to prompt
ready
#8673
opened Sep 20, 2024 by
DarkLight1337
Loading…
[Kernel] Split Marlin MoE kernels into multiple files
ready
ONLY add when PR is ready to merge/full CI is needed
#8661
opened Sep 20, 2024 by
ElizaWszola
Loading…
[Kernel][Triton][AMD] Remove tl.atomic_add from awq_gemm_kernel, 2-5x speedup MI300, minor improvement for MI250
ready
ONLY add when PR is ready to merge/full CI is needed
#8646
opened Sep 20, 2024 by
rasmith
Loading…
[Kernel][Bugfix] Delete some more useless code in marlin_moe_ops.cu
ready
ONLY add when PR is ready to merge/full CI is needed
#8643
opened Sep 19, 2024 by
tlrmchlsmth
Loading…
[Misc] Support FP8 MoE for compressed-tensors
ready
ONLY add when PR is ready to merge/full CI is needed
#8588
opened Sep 19, 2024 by
mgoin
Loading…
4 tasks done
[MISC] add support custom_op check
ready
ONLY add when PR is ready to merge/full CI is needed
#8557
opened Sep 18, 2024 by
jikunshang
Loading…
[Bugfix] fix OpenAI API server startup with --disable-frontend-multiprocessing
ready
ONLY add when PR is ready to merge/full CI is needed
#8537
opened Sep 17, 2024 by
dtrifiro
Loading…
[CI/Build][Misc] Comparing between block manager v1 and v2, under full prefix sharing and no prefix sharing case.
ready
ONLY add when PR is ready to merge/full CI is needed
#8528
opened Sep 16, 2024 by
KuntaiDu
Loading…
[dbrx] refactor dbrx experts to extend FusedMoe class
ready
ONLY add when PR is ready to merge/full CI is needed
#8518
opened Sep 16, 2024 by
divakar-amd
Loading…
[Core] Implementing disaggregated prefilling, and caching KV cache in CPU/disk/database.
ready
ONLY add when PR is ready to merge/full CI is needed
#8498
opened Sep 16, 2024 by
KuntaiDu
Loading…
[Model] Refactor BLIP/BLIP-2 to support composite model loading
ready
ONLY add when PR is ready to merge/full CI is needed
#8407
opened Sep 12, 2024 by
DarkLight1337
Loading…
[Core] Multi-Step + Single Step Prefills via Chunked Prefill code path
ready
ONLY add when PR is ready to merge/full CI is needed
#8378
opened Sep 11, 2024 by
varun-sundar-rabindranath
Loading…
Add output streaming support to multi-step + async
ready
ONLY add when PR is ready to merge/full CI is needed
#8335
opened Sep 10, 2024 by
alexm-neuralmagic
Loading…
[do-not-merge][CI/Build] Buildkite pipeline generator
ready
ONLY add when PR is ready to merge/full CI is needed
#8324
opened Sep 10, 2024 by
khluu
Loading…
[Bugfix] Fix LongRoPE bug
ready
ONLY add when PR is ready to merge/full CI is needed
#8254
opened Sep 7, 2024 by
garg-amit
Loading…
[BugFix] Propagate 'trust_remote_code' setting in internvl and minicpmv
ready
ONLY add when PR is ready to merge/full CI is needed
#8250
opened Sep 6, 2024 by
zifeitong
Loading…
[Kernel] Build flash-attn from source
ready
ONLY add when PR is ready to merge/full CI is needed
#8245
opened Sep 6, 2024 by
ProExpertProg
Loading…
[BugFix] Fix metrics error for --num-scheduler-steps > 1
ready
ONLY add when PR is ready to merge/full CI is needed
#8234
opened Sep 6, 2024 by
yuleil
Loading…
[Misc] Upgrade vllm-flash-attn to v2.6.2
ready
ONLY add when PR is ready to merge/full CI is needed
#8211
opened Sep 5, 2024 by
WoosukKwon
Loading…
[Model] Adding Granite MoE.
ready
ONLY add when PR is ready to merge/full CI is needed
#8206
opened Sep 5, 2024 by
shawntan
Loading…
[Misc] add iteration_tokens metric
ready
ONLY add when PR is ready to merge/full CI is needed
#8140
opened Sep 4, 2024 by
LucasWilkinson
Loading…
[Bugfix] Fix bug in detokenizer.py
ready
ONLY add when PR is ready to merge/full CI is needed
#8112
opened Sep 3, 2024 by
cafeii
Loading…
[Core][Kernel][Misc] Support external swapper for vllm
ready
ONLY add when PR is ready to merge/full CI is needed
#8018
opened Aug 30, 2024 by
zeroorhero
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.