add support for Ollama num_gpu #4353

Fmstrat · 2025-02-25T16:23:30Z

Description

Added support for Ollama num_gpu. This allows for forcing models to use the GPU in memory limited situations. Modeled after existing Ollama specific options such as keepAlive.

Checklist

The relevant docs, if any, have been updated or created
The relevant tests, if any, have been updated or created

Testing instructions

Set your config up with numGpu:

    {
      "title": "Qwen2.5 Coder 14B (Ollama)",
      "provider": "ollama",
      "model": "qwen2.5-coder:14b",
      "apiBase": "http://localhost:11434",
      "contextLength": 2048,
      "completionOptions": {
          "maxTokens": 1024,
          "numGpu": 1000
      },
      "keepAlive": 0
    },

Watch is pass through to Ollama via the Ollama logs (--n-gpu-layers 1000):

level=INFO source=server.go:376 msg="starting llama server" cmd="/usr/lib/ollama/runners/cuda_v12_avx/ollama_llama_server runner --model /root/.ollama/models/blobs/sha256-ac9bc7a69dab38da1c790838955f1293420b55ab555ef6b4615efa1c1507b1ed --ctx-size 2048 --batch-size 512 --n-gpu-layers 1000 --threads 8 --no-mmap --parallel 1 --tensor-split 35,13 --port 33783"

netlify · 2025-02-25T16:23:58Z

✅ Deploy Preview for continuedev ready!

Name	Link
🔨 Latest commit	`abfd726`
🔍 Latest deploy log	https://app.netlify.com/sites/continuedev/deploys/68112810d5363400088fe068
😎 Deploy Preview	https://deploy-preview-4353--continuedev.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Fmstrat · 2025-02-25T16:33:29Z

I think this is failing due to an unrelated timeout?

warcow105 · 2025-03-18T17:27:55Z

This is a welcome addition. It will allow the use of larger models or deeper context without forcing those changes on all of the models.

sestinj · 2025-05-15T18:17:50Z

Hi @Fmstrat, yesterday we shared some updates with our contributors about how we're aiming to improve the contribution process. Part of this included the addition of a Contributor License Agreement (CLA) to protect both contributors and the project. We're reaching out to ask that previous contributors sign it.

Could you please take a moment to sign, or if you have any questions send me a message? (either here or nate@continue.dev would work)

To do so, you just need to post a comment below with the following text:

I have read the CLA Document and I hereby sign the CLA

❤️ Thank you for the work you've done on Continue, and let me know if you have any suggestions on how we can make the project even better!

github-actions · 2025-05-15T18:18:05Z

Thank you for your submission, we really appreciate it. Like many open-source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution. You can sign the CLA by just posting a Pull Request Comment same as the below format.

I have read the CLA Document and I hereby sign the CLA

1 out of 2 committers have signed the CLA.
✅ (sestinj)[https://github.com/sestinj]
❌ @Fmstrat
_{You can retrigger this bot by commenting recheck in this Pull Request.}_{Posted by the CLA Assistant Lite bot.}

add support for Ollama num_gpu

d469933

sestinj added 3 commits March 7, 2025 22:23

Merge branch 'main' into ollama-num-gpu

7f16991

Merge branch 'main' into ollama-num-gpu

e7a1b1f

Merge branch 'main' into ollama-num-gpu

a25169d

sestinj previously approved these changes Apr 29, 2025

View reviewed changes

Merge branch 'main' into ollama-num-gpu

abfd726

sestinj dismissed their stale review via abfd726 April 29, 2025 19:27

sestinj requested a review from a team as a code owner April 29, 2025 19:27

sestinj requested review from tomasz-stefaniak and removed request for a team April 29, 2025 19:27

sestinj merged commit ffd0e9c into continuedev:main Apr 29, 2025
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for Ollama num_gpu #4353

add support for Ollama num_gpu #4353

Uh oh!

Fmstrat commented Feb 25, 2025

Uh oh!

netlify bot commented Feb 25, 2025 •

edited

Loading

Uh oh!

Fmstrat commented Feb 25, 2025

Uh oh!

warcow105 commented Mar 18, 2025

Uh oh!

Uh oh!

sestinj commented May 15, 2025

Uh oh!

github-actions bot commented May 15, 2025

Uh oh!

Uh oh!

add support for Ollama num_gpu #4353

add support for Ollama num_gpu #4353

Uh oh!

Conversation

Fmstrat commented Feb 25, 2025

Description

Checklist

Testing instructions

Uh oh!

netlify bot commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for continuedev ready!

Uh oh!

Fmstrat commented Feb 25, 2025

Uh oh!

warcow105 commented Mar 18, 2025

Uh oh!

Uh oh!

sestinj commented May 15, 2025

Uh oh!

github-actions bot commented May 15, 2025

Uh oh!

Uh oh!

netlify bot commented Feb 25, 2025 •

edited

Loading