Is there a way to terminate vllm.LLM and release the GPU memory

After below code, is there an api(maybe like `llm.terminate`) to kill llm and release the GPU memory?
```
from vllm import LLM, SamplingParams

prompts = [
    "Hello, my name is",
    "The president of the United States is",
    "The capital of France is",
    "The future of AI is",
]
sampling_params = SamplingParams(temperature=0.8, top_p=0.95)
outputs = llm.generate(prompts, sampling_params)
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Is there a way to terminate vllm.LLM and release the GPU memory #1908

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Is there a way to terminate vllm.LLM and release the GPU memory #1908

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions