-
Notifications
You must be signed in to change notification settings - Fork 508
Closed as not planned
Labels
enhancementNew feature or requestNew feature or request
Description
Feature request
The core (backend) of the KvRouter
is the KvIndexer
written in a way that is incredibly general. In particular, the WorkerId
type need not be restricted to i64
, and in fact can be abstracted as a generic trait with minimal code change.
The messy bits are in the frontend components + LLM engine facing parts, including: metric publishing per dp-worker (vllm-project/vllm#18925), metric subscription / aggregation that is dp-aware (#1181), and handing said metrics off to the core router components (#1285 )
Describe the problem you're encountering
N/A
Describe alternatives you've tried
No response
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request