Skip to content

Feat: Router observability (Current QPS, router-side queueing delay, etc) #78

@sitloboi2012

Description

@sitloboi2012

This issue dedicated to discuss about the feature:

(P1) Router observability (Current QPS, router-side queueing delay, number of pending / prefilling / decoding requests, average prefill / decoding length, etc)

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions