docker: add support for CUDA in docker #1461

canardleteer · 2023-05-15T00:37:51Z

Assuming one has the nvidia-container-toolkit installed on Linux, or is using a GPU enabled cloud, cuBLAS should be accessible inside the container.

I'm not very familiar with Github Actions, nor the available execution environments available on GitHub. I wouldn't suggest pre-building these and putting them in the registry, unless there's a CI path for them.

I'm not too sure what the right path for putting these into a registry is. But I did want to contribute so people could try it locally!

canardleteer · 2023-05-15T00:40:31Z

I should add a note, I haven't been able to get the:

-p "Building a website can be done in 10 simple steps:"

...part of the examples in README.md to work for me in Linux because of string escaping (both the original and these new examples). I have tested it with --random-prompt however and it works fine for me with BLAS = 1 and nvidia-smi showing memory usage by llama.cpp.

I just wanted to keep it in line with the other Docker examples.

JohannesGaessler

I was thinking something like this would be nice to have but ultimately prioritized fixing the current issues like memory management first. Thanks.

README.md

JohannesGaessler

I'm not particularly knowledgeable about docker but to me it looks like everything is in order now.

canardleteer · 2023-05-15T20:58:32Z

Fixed a trailing white space error from the Makefile (and installed editorconfig for the future).

canardleteer · 2023-05-20T02:41:26Z

Could I get another run of CI on this?

dkarlovi

Small changes to make it slightly less confusing, otherwise LGTM, it works.

README.md

deep-pipeline · 2023-06-06T15:45:45Z

This looks really useful for integration/deployment of llama.cpp into docker hosted services on cloud!

Is there anything (apart from the vast amount of other activity ;-) ) holding up this being merged now?

Um, on which note, I see that total open pull-requests, not issues(!), has risen from, iirc, 53 to 65 since I last checked in.. which is kinda great but also kinda scary because of implication for level of stress on supervision bandwidth..

canardleteer · 2023-06-06T18:29:49Z

The upstream Makefile now has a conflict that I will need to resolve, and will do so when I get a chance.

JohannesGaessler · 2023-06-06T18:46:27Z

I didn't merge this PR because I wanted someone else to check it as well; as I said, I'm not very knowledgeable about Docker.

canardleteer · 2023-06-06T18:51:43Z

I have rebased on the latest changes in master. A second set of eyes on my changes + a pipeline run would alleviate my concerns about those changes :)

ggerganov

Will wait for an extra approve before merging as I'm unfamiliar with Docker

deep-pipeline · 2023-07-05T22:55:16Z

Hi folks, just a very gentle nudge again on this front - last time I checked here the total open pull-requests on this repo were about 65, a month later they are over 80.. as before I'm worried that more project approval/supervision bandwidth needs to be allocated to clear backlog a bit so older 'finished' PRs like this don't age stuck in limbo, suffer new integration issues and need revisiting..

It would be a shame if good work gets buried/lost/stuck with PRs not folded into main (and if sprawling PR numbers mean stale PRs are not being explicitly closed or duplicates being coalesced).

The docker support stuff ought to be relatively independent of other changes - maybe worth a final CI check and folding in this one now - any problems for nvidia-docker use can be sorted via bug-reports by people trying it (who currently don't have the easy option).

Best, M.

canardleteer force-pushed the feat/docker-cuda branch 3 times, most recently from f4ac04e to fdad997 Compare May 15, 2023 01:25

JohannesGaessler reviewed May 15, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

JohannesGaessler requested changes May 15, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

canardleteer force-pushed the feat/docker-cuda branch from aeb43a1 to 9d121b7 Compare May 15, 2023 16:38

JohannesGaessler approved these changes May 15, 2023

View reviewed changes

dkarlovi suggested changes Jun 2, 2023

View reviewed changes

README.md Show resolved Hide resolved

README.md Show resolved Hide resolved

README.md Show resolved Hide resolved

sime2408 mentioned this pull request Jun 2, 2023

VERY BIG performance improvement and beautiful features zylon-ai/private-gpt#521

Closed

deep-pipeline mentioned this pull request Jun 6, 2023

main: add the possibility to open the prompt cache read-only #1640

Merged

docker: add support for CUDA in docker

8c6c334

canardleteer force-pushed the feat/docker-cuda branch from 0484ffb to 8c6c334 Compare June 6, 2023 18:50

ggerganov requested a review from prusnak June 10, 2023 08:13

ggerganov approved these changes Jun 10, 2023

View reviewed changes

Merge branch 'master' into feat/docker-cuda

5d0e752

ggerganov merged commit 84525e7 into ggerganov:master Jul 7, 2023

canardleteer mentioned this pull request Sep 9, 2023

feat: docker gpu image CI builds #3103

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docker: add support for CUDA in docker #1461

docker: add support for CUDA in docker #1461

canardleteer commented May 15, 2023 •

edited

Loading

canardleteer commented May 15, 2023 •

edited

Loading

JohannesGaessler left a comment

JohannesGaessler left a comment

canardleteer commented May 15, 2023

canardleteer commented May 20, 2023

dkarlovi left a comment

deep-pipeline commented Jun 6, 2023

canardleteer commented Jun 6, 2023

JohannesGaessler commented Jun 6, 2023

canardleteer commented Jun 6, 2023

ggerganov left a comment

deep-pipeline commented Jul 5, 2023

docker: add support for CUDA in docker #1461

docker: add support for CUDA in docker #1461

Conversation

canardleteer commented May 15, 2023 • edited Loading

canardleteer commented May 15, 2023 • edited Loading

JohannesGaessler left a comment

Choose a reason for hiding this comment

JohannesGaessler left a comment

Choose a reason for hiding this comment

canardleteer commented May 15, 2023

canardleteer commented May 20, 2023

dkarlovi left a comment

Choose a reason for hiding this comment

deep-pipeline commented Jun 6, 2023

canardleteer commented Jun 6, 2023

JohannesGaessler commented Jun 6, 2023

canardleteer commented Jun 6, 2023

ggerganov left a comment

Choose a reason for hiding this comment

deep-pipeline commented Jul 5, 2023

canardleteer commented May 15, 2023 •

edited

Loading

canardleteer commented May 15, 2023 •

edited

Loading