Skip to content

Commit 86386e2

Browse files
committed
Rework working buffer allocation, reduces vram use noticeably
Clean up cpu assist code, replaced with ggml-backend offload function
1 parent f315402 commit 86386e2

File tree

4 files changed

+89
-309
lines changed

4 files changed

+89
-309
lines changed

0 commit comments

Comments
 (0)