[RFC]: Disallow extra vocab for LoRA

### Motivation.

Currently, vLLM allows each LoRA adapter to define its own additional vocabulary: https://github.com/vllm-project/vllm/blob/65197a5fb37ef4d8b93e0b99ecc8b902fe948e97/vllm/config/__init__.py#L2456-L2460

However, this introduces significant complexity because:

1. We can no longer assume a single tokenizer per model (since each LoRA adapter can have its own tokenizer).
2. The size of the unembedding layer becomes ambiguous.

### Proposed Change.

Since this feature appears to be rarely used, I propose removing it. Going forward, vLLM will assume that all LoRA adapters for a given model share the same vocabulary.

### Feedback Period.

1 week

### CC List.

@jeejeelee 

### Any Other Things.

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	lora_extra_vocab_size: int = 256
	"""Maximum size of extra vocabulary that can be present in a LoRA adapter
	(added to the base model vocabulary)."""
	lora_vocab_padding_size: ClassVar[int] = current_platform\
	.get_lora_vocab_padding_size()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RFC]: Disallow extra vocab for LoRA #23474

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[RFC]: Disallow extra vocab for LoRA #23474

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions