A way to eject the loaded model to save VRAM? #14449

giteeeeee · 2023-12-28T07:43:24Z

giteeeeee
Dec 28, 2023

I often play games while having a1111 still in the background. It eats ~6g of my vram while loaded with whichever checkpoint.

So is there a way to eject the loaded model somehow, without killing the process?

Answered by w-e-w

Dec 28, 2023

Settings > Actions > Unload SD checkpoint to RAM

View full answer

w-e-w · 2023-12-28T09:44:45Z

w-e-w
Dec 28, 2023
Collaborator

Settings > Actions > Unload SD checkpoint to RAM

3 replies

watzon Aug 4, 2024

This doesn't appear to exist anymore, but it is still the answer that shows up first in Google. Is there a new way to do this?

w-e-w Aug 4, 2024
Collaborator

This doesn't appear to exist anymore

look again it's there

I recall there is some bugs with the system if you have multiple models loaded at once but it still does unload the current model

randelreiss Aug 14, 2024

I've tried this feature. It doesn't appear to free up very much GPU RAM - maybe 10%? And then it frequently crashes AUTOMATIC1111 - I don't know if it's crashed on the web front-end side or something related to the GPU card - but I have to quit AUTOMATIC1111 and restart it to recover.

andreiramani · 2024-05-06T06:00:27Z

andreiramani
May 6, 2024

unload checkpoint to RAM, it will eats up RAM if we change model frequently. How to unload completely from RAM and VRAM?

0 replies

randelreiss · 2024-07-01T16:46:39Z

randelreiss
Jul 1, 2024

Ollama framework has a really handy environment and API accessible variable:

OLLAMA_KEEP_ALIVE=[# of seconds] | [xM] | 0

I think it's mostly used for people who want the last loaded chat model to stay loaded longer. But I use it set to zero to keep the GPU VRAM as empty as possible as soon as possible. This is because I have many users that mostly use the GPU for chat and occasionally for Text-to-speech and SD image creation - loading up the GPU VRAM. Unfortunately SDWeb keeps its last model loaded indefinately. It would be great if SDWeb had a similar Keep Alive option to let us decide how long to keep the last model loaded.

1 reply

freakontrol Aug 14, 2024

I have the same problem using SDWeb as OpenWebUI integration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A way to eject the loaded model to save VRAM? #14449

{{title}}

Replies: 3 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

A way to eject the loaded model to save VRAM? #14449

giteeeeee Dec 28, 2023

Replies: 3 comments · 4 replies

w-e-w Dec 28, 2023 Collaborator

watzon Aug 4, 2024

w-e-w Aug 4, 2024 Collaborator

randelreiss Aug 14, 2024

andreiramani May 6, 2024

randelreiss Jul 1, 2024

freakontrol Aug 14, 2024

giteeeeee
Dec 28, 2023

Replies: 3 comments 4 replies

w-e-w
Dec 28, 2023
Collaborator

w-e-w Aug 4, 2024
Collaborator

andreiramani
May 6, 2024

randelreiss
Jul 1, 2024