[ray.serve.llm] Fix setting up AutoProcessor #50715

GeneDer · 2025-02-19T05:11:42Z

Why are these changes needed?

Was doing an end to end test and found out a different AutoProcessor was set on the vllm engine start vs used in the vlllm deployment. Remove the deep copy on llm_config during constructing of the vllm engine and moved set_processor call into apply_checkpoint_info. Also start to use real vllm engine in testing_model which is used by test_openai_compatibility.py to catch those types of issues.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Gene Su <e870252314@gmail.com>

… engine in the test Signed-off-by: Gene Su <e870252314@gmail.com>

GeneDer · 2025-02-19T05:15:08Z

Seeing this working end2end in a workspace

Signed-off-by: Gene Su <e870252314@gmail.com>

kouroshHakha

LGTM

… e2e release test Signed-off-by: Gene Su <e870252314@gmail.com>

Signed-off-by: Gene Su <e870252314@gmail.com>

GeneDer added 2 commits February 18, 2025 20:14

[LLM] fix setting hf processor

eb4cac0

Signed-off-by: Gene Su <e870252314@gmail.com>

use the same llm config object in vllm engine setup and use real vllm…

9f8ff04

… engine in the test Signed-off-by: Gene Su <e870252314@gmail.com>

GeneDer marked this pull request as ready for review February 19, 2025 05:11

GeneDer requested a review from a team as a code owner February 19, 2025 05:11

GeneDer added the go add ONLY when ready to merge, run all tests label Feb 19, 2025

GeneDer requested a review from kouroshHakha February 19, 2025 05:15

GeneDer assigned kouroshHakha Feb 19, 2025

pass trust_remote_code into apply_checkpoint_info()

57833d6

Signed-off-by: Gene Su <e870252314@gmail.com>

kouroshHakha approved these changes Feb 19, 2025

View reviewed changes

kouroshHakha changed the title ~~[LLM] Fix setting up AutoProcessor~~ [ray.serve.llm] Fix setting up AutoProcessor Feb 19, 2025

kouroshHakha enabled auto-merge (squash) February 19, 2025 05:36

revert back to use mocked vllm engine in test, will follow up on real…

052c32e

… e2e release test Signed-off-by: Gene Su <e870252314@gmail.com>

github-actions bot disabled auto-merge February 19, 2025 07:52

kouroshHakha enabled auto-merge (squash) February 19, 2025 08:04

kouroshHakha merged commit 179af97 into ray-project:master Feb 19, 2025
6 checks passed

GeneDer deleted the fix-setting-processor branch February 20, 2025 00:50

israbbani pushed a commit that referenced this pull request Feb 25, 2025

[ray.serve.llm] Fix setting up AutoProcessor (#50715)

11fadfc

Signed-off-by: Gene Su <e870252314@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ray.serve.llm] Fix setting up AutoProcessor #50715

[ray.serve.llm] Fix setting up AutoProcessor #50715

GeneDer commented Feb 19, 2025

GeneDer commented Feb 19, 2025

kouroshHakha left a comment

[ray.serve.llm] Fix setting up AutoProcessor #50715

[ray.serve.llm] Fix setting up AutoProcessor #50715

Conversation

GeneDer commented Feb 19, 2025

Why are these changes needed?

Related issue number

Checks

GeneDer commented Feb 19, 2025

kouroshHakha left a comment

Choose a reason for hiding this comment