Upgrade vLLM to 0.17.0 by jeffreywang-anyscale · Pull Request #61598 · ray-project/ray

jeffreywang-anyscale · 2026-03-09T18:37:00Z

Description

Briefly describe what this PR accomplishes and why it's needed.

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

python/deplocks/base_deps/ray_base_deps_py3.10.lock

docker/ray-llm/Dockerfile

gemini-code-assist

Code Review

This pull request correctly upgrades vLLM to version 0.17.0 and updates its dependencies accordingly. The code changes are consistent with this upgrade. However, it seems a local configuration for a PyPI index has been accidentally included in many of the dependency lock files. This should be removed to avoid breaking builds.

python/deplocks/base_deps/ray_base_deps_py3.10.lock

python/deplocks/base_deps/ray_base_deps_py3.11.lock

python/deplocks/base_deps/ray_base_deps_py3.12.lock

python/deplocks/base_deps/ray_base_deps_py3.13.lock

python/deplocks/base_extra/ray_base_extra_py3.10.lock

python/deplocks/ray_img/ray_img_py310.lock

python/deplocks/ray_img/ray_img_py311.lock

python/deplocks/ray_img/ray_img_py312.lock

python/deplocks/ray_img/ray_img_py313.lock

python/deplocks/tests/data/datatfds_py3.12.lock

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

aslonnie · 2026-03-09T20:04:11Z

python/deplocks/base_extra_testdeps/ray-base_extra_testdeps_py3.13.lock

-opentelemetry-proto==1.39.0 \
-    --hash=sha256:1e086552ac79acb501485ff0ce75533f70f3382d43d0a30728eeee594f7bf818 \
-    --hash=sha256:c1fa48678ad1a1624258698e59be73f990b7fc1f39e73e16a9d08eef65dd838c
+opentelemetry-proto==1.34.1 \


why is this downgraded?

I think this is compiled from https://github.com/ray-project/ray/blob/15a473454084a739264ce66290d7d4fc1b3926b4/python/requirements/serve/tracing-reqs.txt + all opentelemetry libraries should have the same version.

hmm... that does not really make sense to me.

how is the tracing-reqs.txt get pulled in with this change?

should that be upgraded to 1.39.0 for consistency?

I think it is that some additional dependency of vllm 0.17 is pulling the version down

@elliot-barn could you help investigate? like what will happen if we enforce opentelemetry-proto>=1.39.0 as a constraint?

opentelemetry-proto 1.40.0 depends on protobuf<7.0 and >=5.0 opentelemetry-proto 1.39.1 depends on protobuf<7.0 and >=5.0 opentelemetry-proto 1.39.0 depends on protobuf<7.0 and >=5.0

we are still on 4.25.8 and py313 dependency upgrade initiative will bring us to 5.29.6

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

aslonnie

I would like to understand more on why opentelemetry-proto needs to be downgraded.

cursor · 2026-03-10T17:32:19Z

python/deplocks/base_extra_testdeps/ray-base_extra_testdeps_py3.13.lock

-    --hash=sha256:c1fa48678ad1a1624258698e59be73f990b7fc1f39e73e16a9d08eef65dd838c
+opentelemetry-proto==1.34.1 \
+    --hash=sha256:16286214e405c211fc774187f3e4bbb1351290b8dfb88e8948af209ce85b719e \
+    --hash=sha256:eb4bb5ac27f2562df2d6857fc557b3a481b5e298bc04f94cc68041f00cebcbd2


Unintended opentelemetry-proto downgrade across non-LLM lock files

Medium Severity

opentelemetry-proto is downgraded from 1.39.0 to 1.34.1 not only in LLM lock files but also in unrelated base, slim, and serve lock files that have no dependency on vLLM. As the PR reviewers noted, a vLLM upgrade should not pull down opentelemetry-proto in non-LLM environments. This broad, unexplained regression may indicate a dependency resolution issue — possibly a transitive constraint leaking through shared constraint files — and risks incompatibilities with components expecting the newer proto version.

Additional Locations (2)

python/deplocks/base_slim/ray_base_slim_py3.13.lock#L2048-L2051

python/deplocks/ci/serve_tracing_depset_py3.10.lock#L1940-L1943

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor · 2026-03-10T18:57:12Z

ci/raydepsets/configs/rayllm.depsets.yaml

    - --python-version=${PYTHON_VERSION_STR}
    - --unsafe-package ray
    - --python-platform=x86_64-manylinux_2_31
-    - --index https://download.pytorch.org/whl/${CUDA_CODE}


PyTorch CUDA index removed from LLM build configs

High Severity

The --index https://download.pytorch.org/whl/${CUDA_CODE} flag was removed from LLM depset configs that build for CUDA targets (cu128, cu130_py312). Other GPU configs in the same repo (ray_base_extra_testdeps_gpu, ray_ml_base_extra_testdeps_cuda, release_multimodal_inference_benchmarks_tests) still use --index https://download.pytorch.org/whl/cu128. vLLM 0.17.0 docs still recommend --extra-index-url for the PyTorch CUDA wheel index. Without this index, the uv resolver may not find CUDA-specific PyTorch wheels for cu128/cu130 builds, potentially resolving CPU-only PyTorch when lock files are regenerated.

Additional Locations (2)

ci/raydepsets/configs/llm_release_tests.depsets.yaml#L11-L16

ci/raydepsets/configs/rayimg.depsets.yaml#L67-L71

1. Restore `--index https://download.pytorch.org/whl/${CUDA_CODE}` in rayllm.depsets.yaml. This was accidentally dropped, causing cu128 lockfiles to resolve torchaudio from PyPI instead of the CUDA index. 2. Add numexpr>=2.10 to llm-test-requirements.txt. The CI base Docker image has numexpr compiled against NumPy 1.x, but the lockfile installs NumPy 2.x, causing a binary incompatibility crash. Including numexpr in the lockfile ensures a compatible version overwrites the base image's broken one. 3. Regenerate all 16 LLM lockfiles. Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Master also lacks numexpr in lockfiles with the same NumPy 2.2.6. The CPU test numexpr failure is a base image issue, not caused by this branch. Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor · 2026-03-12T01:11:23Z

docker/ray-llm/Dockerfile


 uv pip install --system --no-cache-dir --no-deps \
-    --index-strategy unsafe-best-match \
+    --index-strategy first-index \


Index strategy inconsistency may break Docker image builds

Medium Severity

The --index-strategy was changed from unsafe-best-match to first-index, inconsistent with every other Dockerfile in the project (base-deps, base-extra, base-slim), which all use unsafe-best-match with an explicit comment explaining its necessity. The lock file rayllm_py311_cu128.lock is compiled using the PyTorch index (via rayllm.depsets.yaml which still has --index https://download.pytorch.org/whl/${CUDA_CODE}), so hashes may originate from that index. With first-index, uv stops at PyPI for packages like torch==2.10.0 and torchvision==0.25.0 and never checks the PyTorch index — if any hashes don't match PyPI's wheels, the build will fail.

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor · 2026-03-14T00:06:28Z

ci/raydepsets/configs/rayimg.depsets.yaml

      - ray_img_depset_${PYTHON_SHORT}
    output: python/deplocks/base_extra_testdeps/ray-llm-base_extra_testdeps_py${PYTHON_VERSION}.lock
    append_flags:
-      - --index https://download.pytorch.org/whl/${CUDA_CODE}


PyTorch CUDA index removed inconsistently from CUDA depsets

Medium Severity

The --index https://download.pytorch.org/whl/${CUDA_CODE} flag was removed from the ray_base_extra_testdeps_llm_cuda depset in rayimg.depsets.yaml and from the common settings in llm_release_tests.depsets.yaml, but similar CUDA depsets like ray_base_extra_testdeps_gpu and ray_ml_base_extra_testdeps_cuda in the same file still retain it. The LLM CUDA depset expands dependencies that include sentence-transformers, which depends on PyTorch — without the CUDA wheel index, the resolver may pull CPU-only PyTorch wheels for what's explicitly a CUDA build.

Additional Locations (1)

ci/raydepsets/configs/llm_release_tests.depsets.yaml#L12-L16

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

cursor · 2026-03-16T20:00:38Z

docker/ray-llm/Dockerfile

+# exact versions, so integrity is maintained through version pinning.
 uv pip install --system --no-cache-dir --no-deps \
    --index-strategy unsafe-best-match \
+    --no-verify-hashes \


Disabling hash verification weakens supply chain security

Medium Severity

Adding --no-verify-hashes disables integrity checking for all packages installed from the lock file. The lock files still contain hashes, but they are completely ignored during installation. This means a compromised or tampered package on the CUDA index (or any alternate index used via unsafe-best-match) could be installed without detection. While version pinning provides some defense, hash verification is the primary protection against supply chain attacks where an index serves a modified binary for a pinned version. A more targeted fix — such as regenerating hashes from the actual CUDA index, or excluding only the mismatched packages — would preserve integrity checking for the majority of dependencies.

jeffreywang-anyscale requested review from a team, aslonnie, edoakes and richardliaw as code owners March 9, 2026 18:37

cursor bot reviewed Mar 9, 2026

View reviewed changes

python/deplocks/base_deps/ray_base_deps_py3.10.lock Outdated Show resolved Hide resolved

docker/ray-llm/Dockerfile Show resolved Hide resolved

gemini-code-assist bot reviewed Mar 9, 2026

View reviewed changes

jeffreywang-anyscale force-pushed the vllm-0.17.0 branch from fb6c09d to 56b154c Compare March 9, 2026 18:52

Upgrade vLLM to 0.17.0

2b1741a

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale force-pushed the vllm-0.17.0 branch from 56b154c to 2b1741a Compare March 9, 2026 18:53

jeffreywang-anyscale added the go add ONLY when ready to merge, run all tests label Mar 9, 2026

ray-gardener bot added serve Ray Serve Related Issue llm labels Mar 9, 2026

aslonnie reviewed Mar 9, 2026

View reviewed changes

jeffreywang-anyscale added 3 commits March 9, 2026 14:40

pytorch 2.10.0 ships CUDA-enabled wheels on PyPI by default

d9f8a11

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Merge branch 'master' into vllm-0.17.0

8956423

Fix premerge

9f7ad02

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

kouroshHakha approved these changes Mar 9, 2026

View reviewed changes

Fixing premerge

5a8bfb4

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale requested a review from a team as a code owner March 9, 2026 23:21

Merge branch 'master' into vllm-0.17.0

b168241

aslonnie reviewed Mar 10, 2026

View reviewed changes

Merge branch 'master' into vllm-0.17.0

97249b4

cursor bot reviewed Mar 10, 2026

View reviewed changes

Fixing premerge

1f5725c

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor bot reviewed Mar 10, 2026

View reviewed changes

jeffreywang-anyscale added 2 commits March 10, 2026 19:05

Revert numexpr addition - pre-existing issue unrelated to vLLM upgrade

d9d7a78

Master also lacks numexpr in lockfiles with the same NumPy 2.2.6. The CPU test numexpr failure is a base image issue, not caused by this branch. Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale force-pushed the vllm-0.17.0 branch from 518bd4c to d9d7a78 Compare March 11, 2026 02:06

jeffreywang-anyscale added 2 commits March 11, 2026 16:14

wip

ad96776

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

fix

35b269f

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor bot reviewed Mar 12, 2026

View reviewed changes

jeffreywang-anyscale mentioned this pull request Mar 12, 2026

[llm] Upgrade vllm to 0.17.0 in ray-llm #61445

Open

jeffreywang-anyscale added 2 commits March 12, 2026 09:59

Switch back to unsafe-best-match

a058faa

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

Attempt to fix premerge

db67355

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale requested a review from a team as a code owner March 14, 2026 00:03

cursor bot reviewed Mar 14, 2026

View reviewed changes

Restore extra-index-url for cu130

9a06636

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

jeffreywang-anyscale force-pushed the vllm-0.17.0 branch from 68b5b08 to 9a06636 Compare March 16, 2026 17:45

jeffreywang-anyscale added 2 commits March 16, 2026 10:45

Merge branch 'master' into vllm-0.17.0

31f09d8

no-verify-hashes

63fc455

Signed-off-by: Jeffrey Wang <jeffreywang@anyscale.com>

cursor bot reviewed Mar 16, 2026

View reviewed changes

Conversation

jeffreywang-anyscale commented Mar 9, 2026

Description

Related issues

Additional information

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aslonnie Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

jeffreywang-anyscale Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

aslonnie Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

aslonnie Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

elliot-barn Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

aslonnie left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 10, 2026

Choose a reason for hiding this comment

Unintended opentelemetry-proto downgrade across non-LLM lock files

Uh oh!

cursor bot Mar 10, 2026

Choose a reason for hiding this comment

PyTorch CUDA index removed from LLM build configs

Uh oh!

cursor bot Mar 12, 2026

Choose a reason for hiding this comment

Index strategy inconsistency may break Docker image builds

Uh oh!

cursor bot Mar 14, 2026

Choose a reason for hiding this comment

PyTorch CUDA index removed inconsistently from CUDA depsets

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 16, 2026

Choose a reason for hiding this comment

Disabling hash verification weakens supply chain security

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants