[WIP] Video support in vLLM backend #42919

zucchini-nlp · 2025-12-17T10:35:59Z

What does this PR do?

Enables vllm-project/vllm#30680 from transformers side. I had this in my local draft for a very long time and some models still don't fit neatly, such as InternVL uses image token-id per video-frame processing or SmolVLM adds timestamps between each frame

I think we will support models that are easy first, in progress for now.

fyi @hmellor

github-actions · 2025-12-17T10:37:06Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: blip_2, got_ocr2, instructblip, instructblipvideo, internvl, llava_next_video, llava_onevision, perception_lm, qwen2_5_omni, qwen2_5_vl, qwen2_vl, smolvlm, video_llava

github-actions · 2025-12-17T10:41:36Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=42919&sha=ce0dfb

HuggingFaceDocBuilderDev · 2025-12-17T10:45:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp added 2 commits December 16, 2025 15:38

just push what I have now

f9622fa

Merge remote-tracking branch 'upstream/main' into video-mm-tokens

ce0dfb5

zucchini-nlp changed the title ~~Video mm tokens~~ Video support in vLLM backend Dec 17, 2025

zucchini-nlp changed the title ~~Video support in vLLM backend~~ [WIP] Video support in vLLM backend Dec 17, 2025

hmellor mentioned this pull request Dec 18, 2025

[Model] Add video input support for transformers modeling backend vllm-project/vllm#30680

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Video support in vLLM backend #42919

[WIP] Video support in vLLM backend #42919

zucchini-nlp commented Dec 17, 2025

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Video support in vLLM backend #42919

Are you sure you want to change the base?

[WIP] Video support in vLLM backend #42919

Conversation

zucchini-nlp commented Dec 17, 2025

What does this PR do?

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants