Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Commit 204a1d4

Browse files
tjohnson31415dtrifiro
authored andcommitted
feat: allow long max seq length
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
1 parent 9460cfb commit 204a1d4

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

Dockerfile.ubi

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -183,6 +183,10 @@ RUN --mount=type=cache,target=/root/.cache/pip \
183183
ENV HF_HUB_OFFLINE=1 \
184184
PORT=8000 \
185185
HOME=/home/vllm \
186+
# Allow requested max length to exceed what is extracted from the
187+
# config.json
188+
# see: https://github.com/vllm-project/vllm/pull/7080
189+
VLLM_ALLOW_LONG_MAX_MODEL_LEN=1 \
186190
VLLM_USAGE_SOURCE=production-docker-image \
187191
VLLM_WORKER_MULTIPROC_METHOD=fork
188192

0 commit comments

Comments
 (0)