#rllama docker on nvidia

Getting OpenCL to work inside docker.

Please note that this also requires some packages and modifications on your host system in order to allow the containers to use nvidia GPU features such as compute.

For each of the described distro / distro-family you could follow the instructions at the given links below.

Note: You also need an upto-date version of docker/docker-ce so be sure to follow the instructions to install docker for your distro from the docker website.

Note2: I have only personally tested the instructions on fedora/nobara and hence, cannot guarantee the accuracy of the instructions for other distros.

Feel free to contribute/improve the instructions for existing and other distros.

Usage

docker build -f ./.docker/nvidia.dockerfile -t rllama:nvidia .

docker run --rm --gpus all --privileged -v /models/LLaMA:/models:z -it rllama:nvidia \
    rllama --model-path /models/7B \
           --param-path /models/7B/params.json \
           --tokenizer-path /models/tokenizer.model \
           --prompt "hi I like cheese"

Replace /models/LLaMA with the directory you've downloaded your models to. The :z in -v flag may or may not be needed depending on your distribution (I needed it on Fedora Linux)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvidia.md

nvidia.md

Getting OpenCL to work inside docker.

Fedora / Fedora-based

Debian / Debian-based / Ubuntu / Ubuntu-based

Arch / Arch-based

Usage

Files

nvidia.md

Latest commit

History

nvidia.md

File metadata and controls

Getting OpenCL to work inside docker.

Fedora / Fedora-based

Debian / Debian-based / Ubuntu / Ubuntu-based

Arch / Arch-based

Usage