[Misc]: Question: Where can I find getting computed kv cache code on v0.

Hi, I'm trying understanding the work flow of vLLM and I'm intersting in Prefix Caching.

So, I want to know the conditions of prefix caching and who (Scheduler, Executor, Worker, Runner etc. ) get kv cache.

In v1 code, I found it.

https://github.com/vllm-project/vllm/blob/067fa2255b6687ccaa79391dc9d1a08c7632f605/vllm/v1/core/scheduler.py#L215-L217

But, v0, I failed to find it. 

Does any one help me? 

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc]: Question: Where can I find getting computed kv cache code on v0. #13327

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	# Get already-cached tokens.
	computed_blocks, num_computed_tokens = \
	self.kv_cache_manager.get_computed_blocks(request)

Uh oh!

[Misc]: Question: Where can I find getting computed kv cache code on v0. #13327

Description

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions