[feat]: verl patches for vllm 13 compatibility #4713

yeshsurya · 2026-01-06T10:45:51Z

This pull request introduces significant improvements to the vLLM-based rollout server infrastructure, focusing on better LoRA (Low-Rank Adaptation) support, compatibility with newer vLLM versions, enhanced IPv6 handling, and improved modularity. The main updates include introducing a custom LoRA request and hijack logic, updating dependencies, refactoring the vLLM server class hierarchy, and ensuring compatibility with vLLM 0.13.0+ and Ray 2.53.0.

Key changes:

LoRA (Low-Rank Adaptation) Support and vLLM Integration

Added a new utils.py module with a custom TensorLoRARequest and VLLMHijack class to allow loading LoRA adapters directly from tensors, overcoming vLLM's limitation of only supporting file-based LoRA adapters. This enables more flexible and efficient LoRA synchronization between actor models.
Updated the vLLM HTTP server logic to inject LoRA requests when the LoRA adapter is loaded, and to set LoRA-related arguments (enable_lora, max_loras, max_lora_rank) based on the model configuration. [1] [2]

Compatibility and Dependency Updates

Upgraded Ray to version 2.53.0 in requirements.txt for improved compatibility and performance.
Updated the Dockerfile to install specific versions of PyTorch, torchvision, torchaudio, and reordered the installation of flash-attn for CUDA 12.6 compatibility.
Switched from pickle to cloudpickle for serialization in vllm_async_server, improving compatibility with complex Python objects.

vLLM Server Refactor and API Adjustments

Refactored the vLLM HTTP server into a base class (vLLMHttpServerBase) and a Ray-remote subclass (vLLMHttpServer), improving code modularity and clarity.
Updated method signatures and internal logic for better type safety and to reflect that only RolloutConfig (not RewardModelConfig) is accepted in the vLLM HTTP server. [1] [2] [3]
Updated imports for compatibility with vLLM 0.13.0+ (e.g., splitting FlexibleArgumentParser and get_tcp_uri imports).

IPv6 and Networking Improvements

Improved ZeroMQ socket handling to support IPv6 addresses and updated address formatting logic throughout the server and replica classes. [1] [2]
Added logic to determine if an address is IPv6 and format it accordingly when setting up server addresses.

Miscellaneous Enhancements

Ensured that when non-blocking calls are made to the distributed executor, results are wrapped in Future objects for compatibility with vLLM 0.13.0+. [1] [2]
Added Prometheus model name extraction logic for improved observability.
Various minor improvements and bug fixes, such as logging enhancements and argument handling. [1] [2] [3]

These changes collectively improve the flexibility, maintainability, and compatibility of the rollout server infrastructure with the latest vLLM and Ray versions, while enabling advanced LoRA workflows.

github-actions · 2026-01-06T10:46:32Z

Test Results for assets-test

0 tests 0 ✅ 0s ⏱️
0 suites 0 💤
0 files 0 ❌

Results for commit 58fcdf3.

♻️ This comment has been updated with latest results.

yeshsurya requested review from a team as code owners January 6, 2026 10:45

yeshsurya temporarily deployed to Testing January 6, 2026 10:46 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 6, 2026 10:46 — with GitHub Actions Failure

yeshsurya had a problem deploying to Testing January 6, 2026 11:38 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 6, 2026 12:11 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 6, 2026 12:11 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 6, 2026 12:11 — with GitHub Actions Inactive

yeshsurya temporarily deployed to Testing January 6, 2026 13:51 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 6, 2026 13:51 — with GitHub Actions Failure

yeshsurya force-pushed the yeshwanth/torch_and_dep_upgrade_in_environment branch from a52f490 to 2aba2e3 Compare January 7, 2026 04:11

yeshsurya temporarily deployed to Testing January 7, 2026 04:11 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 04:11 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 04:17 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 04:17 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 07:16 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 07:16 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 07:16 — with GitHub Actions Inactive

yeshsurya temporarily deployed to Testing January 7, 2026 08:31 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 08:31 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 09:14 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 09:14 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 09:14 — with GitHub Actions Inactive

yeshsurya temporarily deployed to Testing January 7, 2026 10:00 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 10:00 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 11:41 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 11:41 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 12:12 — with GitHub Actions Inactive

yeshsurya temporarily deployed to Testing January 7, 2026 12:13 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 12:13 — with GitHub Actions Failure

yeshsurya added 11 commits January 7, 2026 21:13

[feat]: Update torch for vllm upgrade and verl patches for compatibility

69fd0f9

[feat]: upgrade ray required from 2 node test

7616172

[update]: flash attention build giving layers post torch upgrade

783e38c

[update]: set max procs to 4 for build pool memory limitation

02f9944

[test]: upgrade pip since seeing build failure only in build ci

ca37c0b

[update]: force flash attention build since it fails in ci/cd build

1112da4

[update]: reduce thread for ci/cd pool's memory constraint

31402a1

[update]: trying flash attn build with one thread

050ffef

[test]: limit nvcc threads to 1

5da1ca1

[test]: jobs 4 nvcc thread to 1

b691b9e

[test]: 2 main threads 1 nvcc thread

58fcdf3

yeshsurya force-pushed the yeshwanth/torch_and_dep_upgrade_in_environment branch from 928a52b to 58fcdf3 Compare January 7, 2026 15:43

yeshsurya temporarily deployed to Testing January 7, 2026 15:43 — with GitHub Actions Inactive

yeshsurya had a problem deploying to Testing January 7, 2026 15:43 — with GitHub Actions Failure

yeshsurya temporarily deployed to Testing January 7, 2026 15:43 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat]: verl patches for vllm 13 compatibility #4713

[feat]: verl patches for vllm 13 compatibility #4713

yeshsurya commented Jan 6, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[feat]: verl patches for vllm 13 compatibility #4713

Are you sure you want to change the base?

[feat]: verl patches for vllm 13 compatibility #4713

Conversation

yeshsurya commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LoRA (Low-Rank Adaptation) Support and vLLM Integration

Compatibility and Dependency Updates

vLLM Server Refactor and API Adjustments

IPv6 and Networking Improvements

Miscellaneous Enhancements

Uh oh!

github-actions bot commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results for assets-test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yeshsurya commented Jan 6, 2026 •

edited

Loading

github-actions bot commented Jan 6, 2026 •

edited

Loading