-
Notifications
You must be signed in to change notification settings - Fork 13.1k
Open
Labels
hardwareHardware relatedHardware relatedhelp wantedNeeds help from the communityNeeds help from the communityresearch 🔬🦙.llamallama
Description
Now that distributed inference is supported thanks to the work of @evanmiller in #2099 it would be fun to try to utilize it for something cool. One such idea is to connect a bunch of Raspberry Pis in a local network and run the inference using MPI:
# sample cluster of 8 devices (replace with actual IP addresses of the devices)
$ cat ./hostfile
192.168.0.1:1
192.168.0.2:1
192.168.0.3:1
192.168.0.4:1
192.168.0.5:1
192.168.0.6:1
192.168.0.7:1
192.168.0.8:1
# build with MPI support
$ make CC=mpicc CXX=mpicxx LLAMA_MPI=1 -j
# run distributed inference over 8 nodes
$ mpirun -hostfile ./hostfile -n 8 ./main -m /mnt/models/65B/ggml-model-q4_0.bin -p "I believe the meaning of life is" -n 64
Here we assume that the 65B model data is located on a network share in /mnt
and that mmap
works over a network share.
Not sure if that is the case - if not, then it would be more difficult to perform this experiment.
Looking for people with access to the necessary hardware to perform this experiment
aiaicode, lin72h, xznhj8129, nightlyworker, LokiMetaSmith and 10 morelin72h, DamiaPoquet and FNsiwmertens, LokiMetaSmith, FNsi, lin72h, j0n4th4n-1410 and 2 more
Metadata
Metadata
Assignees
Labels
hardwareHardware relatedHardware relatedhelp wantedNeeds help from the communityNeeds help from the communityresearch 🔬🦙.llamallama