[Bug]: high cpu utilization when there is no inference task

### Your current environment

I deploy models with the `vllm/vllm-openai:v0.9.0.1` docker image.
The command line arguments: `--tensor-parallel-size 2` and `--enforce-eager`.




### 🐛 Describe the bug

The CPU utlization is high even if no tasks are running.

<img width="588" alt="Image" src="https://github.com/user-attachments/assets/1fe2034e-5b2f-46a5-a665-f9fa407386f1" />

This does not depend on models. I encountered the issue with both qwen3 and devstral models.

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: high cpu utilization when there is no inference task #19243

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: high cpu utilization when there is no inference task #19243

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions