You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`limit_mm_per_prompt` also accepts configurable options per modality. In the configurable form, you still specify `count`, and you may optionally provide size hints that control how vLLM profiles and reserves memory for your multi‑modal inputs. This helps you tune memory for the actual media you expect, instead of the model’s absolute maxima.
Details could be found in [`ImageDummyOptions`][vllm.config.multimodal.ImageDummyOptions], [`VideoDummyOptions`][vllm.config.multimodal.VideoDummyOptions], and [`AudioDummyOptions`][vllm.config.multimodal.AudioDummyOptions].
136
+
137
+
Examples:
138
+
139
+
```python
140
+
from vllm importLLM
141
+
142
+
# Up to 5 images per prompt, profile with 512x512.
143
+
# Up to 1 video per prompt, profile with 32 frames at 640x640.
For backward compatibility, passing an integer works as before and is interpreted as `{"count": <int>}`. For example:
154
+
155
+
-`limit_mm_per_prompt={"image": 5}` is equivalent to `limit_mm_per_prompt={"image": {"count": 5}}`
156
+
- You can mix formats: `limit_mm_per_prompt={"image": 5, "video": {"count": 1, "num_frames": 32, "width": 640, "height": 640}}`
157
+
158
+
!!! note
159
+
- The size hints affect memory profiling only. They shape the dummy inputs used to compute reserved activation sizes. They do not change how inputs are actually processed at inference time.
160
+
- If a hint exceeds what the model can accept, vLLM clamps it to the model's effective maximum and may log a warning.
161
+
162
+
!!! warning
163
+
These size hints currently only affect activation memory profiling. Encoder cache size is determined by the actual inputs at runtime and is not limited by these hints.
164
+
125
165
## Multi-modal processor arguments
126
166
127
167
For certain models, you can adjust the multi-modal processor arguments to
0 commit comments