Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Disable chunked prefill and/or prefix caching when MLA is enabled (vl…
…lm-project#12642) From @mgoin in vllm-project#12638 I cannot push to that branch, therefore a new PR to unblock release. --------- Signed-off-by: mgoin <michael@neuralmagic.com> Signed-off-by: simon-mo <simon.mo@hey.com> Co-authored-by: mgoin <michael@neuralmagic.com>
- Loading branch information