[Core] Remove lora additional vocabulary #23540

ahengljh · 2025-08-25T09:49:55Z

Purpose

This PR removes the ability for each LoRA adapter to define its own additional
vocabulary, as proposed in issue #23474. All LoRA adapters for a given model now
share the same vocabulary as the base model.

Test Plan

Test basic LoRA initialization:

import vllm
llm = vllm.LLM(model="Qwen3-8B", enable_lora=True)

Test LoRA adapter loading (with existing adapters):

from vllm.lora.request import LoRARequest
lora_request = LoRARequest("adapter1", 1, "/path/to/lora/adapter")
outputs = llm.generate(["Test prompt"], lora_request=lora_request)

Run LoRA-related tests:

pytest tests/lora/test_lora_manager.py -v
pytest tests/lora/test_layers.py -v

Test Result

✅ Successfully initialized vLLM with enable_lora=True without AttributeError
✅ LoRAConfig no longer has lora_extra_vocab_size attribute
✅ All model files updated to remove additional vocabulary references

(Optional) Documentation Update

No documentation updates required as this removes an undocumented internal feature.

Essential Elements of an Effective PR Description Checklist

- [X] The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)". - [X] The test plan, such as providing test command. - [X] The test results, such as pasting the results comparison before and after, or e2e results - [ ] (Optional) The necessary documentation update, such as updating `supported_models.md` and `examples` for a new model.

BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing

gemini-code-assist

Code Review

This pull request removes the functionality for LoRA adapters to have their own additional vocabulary, which simplifies the LoRA implementation across the codebase. The changes are extensive and touch many files, primarily removing lora_extra_vocab_size and related logic.

The changes are generally correct and align with the goal of the PR. I've identified a few areas where the code can be cleaned up for better maintainability, such as removing redundant code blocks and expressions. These patterns are repeated across several model definition files, and I've provided examples in the comments. Addressing them would improve the overall code quality.

gemini-code-assist · 2025-08-25T09:51:59Z

vllm/lora/layers.py

The expression self.base_layer.org_vocab_size + 0 is redundant. It can be simplified to self.base_layer.org_vocab_size for better code clarity.

Suggested change

self.base_layer.org_vocab_size +

lora_config.lora_extra_vocab_size,

0, # No extra vocab size

self.base_layer.org_vocab_size,

gemini-code-assist · 2025-08-25T09:51:59Z

vllm/model_executor/models/bamba.py

+        lora_vocab = ((0 *
                       (lora_config.max_loras or 1)) if lora_config else 0)


The lora_vocab variable is now always 0 and is unused. It can be removed for clarity. This pattern of an unused lora_vocab variable is present in several other model files and should be addressed there as well.

gemini-code-assist · 2025-08-25T09:51:59Z

vllm/model_executor/models/minicpm.py

        if lora_config:
-            unpadded_vocab_size += lora_config.lora_extra_vocab_size
+            # No additional vocabulary for LoRA


This if lora_config: block is now empty and can be removed to improve code clarity. This pattern is repeated in several other model files (e.g., minicpm_eagle.py, phi4flash.py, phi4mm.py, step3_text.py) and should be addressed there as well.

jeejeelee · 2025-08-26T03:01:54Z

As a start, maybe it would be better to just provide a deprecation warning for lora_extra_vocab_size

ahengljh · 2025-08-26T03:04:42Z

As a start, maybe it would be better to just provide a deprecation warning for lora_extra_vocab_size

Oh I see. So we should hold this PR and just add warning info to users at this moment right?

jeejeelee · 2025-08-26T03:26:52Z

As a start, maybe it would be better to just provide a deprecation warning for lora_extra_vocab_size

Oh I see. So we should hold this PR and just add warning info to users at this moment right?

I think so

ahengljh · 2025-08-26T08:41:27Z

As a start, maybe it would be better to just provide a deprecation warning for lora_extra_vocab_size

Oh I see. So we should hold this PR and just add warning info to users at this moment right?

I think so

I just made another one-line PR #23635 for adding the warning, and we may hold this PR until you believe it's time to deprecate this feature completely. What's your opinion on this?

jeejeelee · 2025-09-19T01:55:26Z

I have trigged the lora testing to verify these changes

jeejeelee · 2025-09-19T03:58:43Z

All LoRA tests are failing. Could you take a closer look into this? @ahengljh

ahengljh · 2025-09-19T03:59:55Z

All LoRA tests are failing. Could you take a closer look into this? @ahengljh

Yes, I noticed that and will try to fix them

mergify · 2025-09-19T19:22:59Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ahengljh.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

jeejeelee · 2025-09-22T06:42:03Z

@ahengljh could we speed up a bit? Thank you.

ahengljh · 2025-09-22T09:42:16Z

@ahengljh could we speed up a bit? Thank you.

I have been trying run the lora tests on my local machine but I found that even I ran them on the main branch, they still fail, which seems difficult for me to debug. I need to investigate and try more.

mergify · 2025-09-24T00:44:17Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ahengljh.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2025-09-29T19:57:08Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ahengljh.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

jeejeelee · 2025-10-10T02:39:40Z

@ahengljh Would you like to continue working on this PR? If so, could you resolve the conflicts first?

ahengljh · 2025-10-12T07:41:25Z

@ahengljh Would you like to continue working on this PR? If so, could you resolve the conflicts first?

Yes, of course, and sorry for being late, I can solve conflicts first. But there has been some problems on my local gpu env to run tests to debug. So I may need a little help on passing tests.

jeejeelee · 2025-10-13T08:39:41Z

Okay, please feel free to ping me on slack

…l-vocabulary

mergify · 2025-10-17T18:56:04Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ahengljh.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

WoosukKwon · 2025-10-28T00:31:39Z

@ahengljh @jeejeelee Do we have any updates? 😅

jeejeelee · 2025-10-29T05:17:34Z

Sorry for missing this PR, @ahengljh would you like to continue working on this PR?

ahengljh requested review from ProExpertProg, WoosukKwon, alexm-redhat, comaniac, hmellor, houseroad, jeejeelee, mgoin, njhill, patrickvonplaten, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256, youkaichao and ywang96 as code owners August 25, 2025 09:49

mergify bot added llama Related to Llama models speculative-decoding v1 tpu Related to Google TPUs labels Aug 25, 2025

gemini-code-assist bot reviewed Aug 25, 2025

View reviewed changes

ahengljh force-pushed the remove-lora-additional-vocabulary branch 2 times, most recently from d0dd2eb to fbea0b6 Compare August 25, 2025 09:58

ahengljh force-pushed the remove-lora-additional-vocabulary branch from 2634859 to e0018fc Compare August 26, 2025 03:35

ahengljh mentioned this pull request Aug 26, 2025

Add deprecation warning for lora_extra_vocab_size #23635

Merged

5 tasks

mergify bot added the needs-rebase label Sep 18, 2025

ahengljh force-pushed the remove-lora-additional-vocabulary branch 2 times, most recently from e6239d0 to a4677dd Compare September 18, 2025 06:37

mergify bot removed the needs-rebase label Sep 18, 2025

mergify bot added the needs-rebase label Sep 19, 2025

ahengljh force-pushed the remove-lora-additional-vocabulary branch from f7ad2ef to 2f5ee25 Compare September 22, 2025 06:58

mergify bot removed the needs-rebase label Sep 22, 2025

mergify bot added the needs-rebase label Sep 24, 2025

ahengljh force-pushed the remove-lora-additional-vocabulary branch from 007c50e to 1606db4 Compare September 26, 2025 09:44

mergify bot removed the needs-rebase label Sep 26, 2025

mergify bot added the needs-rebase label Sep 29, 2025

jeejeelee mentioned this pull request Oct 4, 2025

[Bugfix][Model] Support (zero-padded) LoRA on Qwen3 output embedding #26115

Open

5 tasks

Merge remote-tracking branch 'origin/main' into remove-lora-additiona…

ee013d7

…l-vocabulary

ahengljh force-pushed the remove-lora-additional-vocabulary branch from 1606db4 to ee013d7 Compare October 15, 2025 14:37

mergify bot removed the needs-rebase label Oct 15, 2025

mergify bot added the needs-rebase label Oct 17, 2025

-                self.base_layer.org_vocab_size +
-                lora_config.lora_extra_vocab_size,
-,  # No extra vocab size
+                self.base_layer.org_vocab_size,

		lora_vocab = ((0 *
		(lora_config.max_loras or 1)) if lora_config else 0)

Uh oh!

[Core] Remove lora additional vocabulary #23540

Are you sure you want to change the base?

[Core] Remove lora additional vocabulary #23540

Conversation

ahengljh commented Aug 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

jeejeelee commented Aug 26, 2025

Uh oh!

ahengljh commented Aug 26, 2025

Uh oh!

jeejeelee commented Aug 26, 2025

Uh oh!

ahengljh commented Aug 26, 2025

Uh oh!

jeejeelee commented Sep 19, 2025

Uh oh!

jeejeelee commented Sep 19, 2025

Uh oh!

ahengljh commented Sep 19, 2025

Uh oh!

mergify bot commented Sep 19, 2025

Uh oh!

jeejeelee commented Sep 22, 2025

Uh oh!

ahengljh commented Sep 22, 2025

Uh oh!

mergify bot commented Sep 24, 2025

Uh oh!

mergify bot commented Sep 29, 2025

Uh oh!

jeejeelee commented Oct 10, 2025

Uh oh!

ahengljh commented Oct 12, 2025

Uh oh!

jeejeelee commented Oct 13, 2025

Uh oh!

mergify bot commented Oct 17, 2025

Uh oh!

WoosukKwon commented Oct 28, 2025

Uh oh!

jeejeelee commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ahengljh commented Aug 25, 2025 •

edited by github-actions bot

Loading