Skip to content

Commit 28f7964

Browse files
committed
[V0][Metrics] Remove unimplemented vllm:tokens_total
It looks like vllm-project#4464 intended to add this alongside the vllm:iteration_tokens_total histogram, but didn't actully hook it up and would never have appeard in /metrics. Since it's clearly not critical to anyone, let's just remove it until we hear of a need for it. Signed-off-by: Mark McLoughlin <markmc@redhat.com>
1 parent e584b85 commit 28f7964

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

vllm/engine/metrics.py

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -115,10 +115,6 @@ def __init__(self, labelnames: List[str], vllm_config: VllmConfig):
115115
name="vllm:generation_tokens_total",
116116
documentation="Number of generation tokens processed.",
117117
labelnames=labelnames)
118-
self.counter_tokens = self._counter_cls(
119-
name="vllm:tokens_total",
120-
documentation="Number of prefill plus generation tokens processed.",
121-
labelnames=labelnames)
122118
buckets = [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096]
123119
if not vllm_config.model_config.enforce_eager:
124120
buckets = vllm_config.compilation_config.\

0 commit comments

Comments
 (0)