Skip to content

Conversation

kzawora-intel
Copy link
Collaborator

@kzawora-intel kzawora-intel commented Jul 17, 2025

ripped from: HabanaAI/vllm-fork#1606, fixes weird bucketing anomaly where bs=1 prefills would be padded to bs=2 and trigger a recompilation

Signed-off-by: Konrad Zawora <kzawora@habana.ai>
@kzawora-intel kzawora-intel force-pushed the private/kzawora/prefill_bucketing branch from e5e2414 to c4b4364 Compare July 17, 2025 08:24
@kzawora-intel kzawora-intel enabled auto-merge (squash) July 17, 2025 08:57
@kzawora-intel kzawora-intel disabled auto-merge July 17, 2025 08:58
@kzawora-intel kzawora-intel merged commit d1c0283 into main Jul 17, 2025
3 checks passed
@kzawora-intel kzawora-intel deleted the private/kzawora/prefill_bucketing branch July 28, 2025 10:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant