Skip to content

[llm] Roadmap for Data and Serve LLM APIs #51313

@kouroshHakha

Description

@kouroshHakha

This document includes a list of issues / feature requests that we have collected across the oss and other channels. We’ll update this list with relevant info from issues, etc as we go. If there are any features that are not prioritized here, please feel free to open an RFC or feature request or post on the slack community channel. Follow this form for joining slack: https://www.ray.io/join-slack

Core features

Serve

Data

CI/CD and release pipeline

  • [P0] Release tests for structured output (llm.data) @lk-chen
  • [P0] For Serve release tests use gen-config on the critical path

Docs and community support

  • [P0] Cover gen-config in serve docs
  • [P0] Run doc-test on examples @lk-chen
  • [P0] Update vllm docs with ray cluster setup guide and serve and data code examples
  • [P1] Example of running deepseek R1 (huge model with ray serve multi node)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions