Logit Processor additional data

Hi,

Currently, logit processors are only aware of the previous generated tokens and the logits: https://github.com/vllm-project/vllm/blob/2acd76f346efcdff4f6ca1d92fe1575c448e4b70/vllm/model_executor/layers/sampler.py#L210

For some applications (like watermarking), additional context can be useful, such as the index of the generation in the original batch. It would be nice if these could be passed as optional parameters to logit processors. Passing the `seq_id` variable would be neat for instance. 

Additionally, having a higher level interface (such as a LogitProcessor similar to the one in HuggingFace that handles batches instead of individual generations) could be useful (see the changes in https://github.com/vllm-project/vllm/compare/main...julien-piet:vllm:main).

Happy to help out with this!
Julien

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Logit Processor additional data #2142

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Logit Processor additional data #2142

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions