-
Notifications
You must be signed in to change notification settings - Fork 142
Closed
Labels
Staleawaiting responseWaiting for the author of the issue to provide more information or answer a questionWaiting for the author of the issue to provide more information or answer a question
Description
Right now, running Ollama via Docker on macOS doesn’t use the GPU because Docker can’t access Apple Silicon GPUs. That’s a pretty big performance hit, especially when working with larger models.
This blog post lays it out clearly: https://chariotsolutions.com/blog/post/apple-silicon-gpus-docker-and-ollama-pick-two/
When you run Ollama natively, it detects and uses the Metal backend:
level=INFO msg="Dynamic LLM libraries [metal]"
But in Docker:
level=INFO msg="no GPU detected"
It’d be great if Ollama could run natively on macOS without Docker, so we can take advantage of Apple’s GPU acceleration. Performance and dev experience would be much better.
feO2x and cdschneidertjwald
Metadata
Metadata
Assignees
Labels
Staleawaiting responseWaiting for the author of the issue to provide more information or answer a questionWaiting for the author of the issue to provide more information or answer a question