Skip to content

Commit e5b2204

Browse files
varun-sundar-rabindranathVarun Sundar Rabindranath
authored andcommitted
[Bugfix] LoRA : Fix the order in which the kernels process LoRAs (vllm-project#16040)
Signed-off-by: Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com> Signed-off-by: Yang Wang <elainewy@meta.com>
1 parent 3de70f6 commit e5b2204

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/lora/ops/triton_ops/lora_kernel_metadata.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ def prepare_tensors(self, token_lora_mapping: torch.Tensor) -> None:
111111

112112
# active_lora_ids, num_tokens_per_lora
113113
lora_ids, num_tokens_per_lora = torch.unique(token_lora_mapping,
114-
sorted=False,
114+
sorted=True,
115115
return_counts=True)
116116
self.active_lora_ids[:lora_ids.size(0)].copy_(lora_ids,
117117
non_blocking=True)

0 commit comments

Comments
 (0)