Skip to content

[Misc]: running multiple vLLM instances on a single ray cluster #14277

Open
@gitlawr

Description

@gitlawr

Anything you want to discuss about vllm.

When running the vLLM OpenAI server on a Ray cluster(with nodes A/B/C/D), I want to specify particular nodes(e.g., node A and B) for deployment, enabling better control over multiple vLLM instances within a single Ray cluster. Currently, it seems that Ray integration appears limited to specifying tp and pp.

Is supporting custom placement group a feasible option?

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    miscrayanything related with raystaleOver 90 days of inactivity

    Type

    No type

    Projects

    Status

    Non-Bugs

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions