docker-compose up -d
ssh -i id_rsa pi@localhost
to start an SSH session in the master node.ssh-keyscan slave > .ssh/known_hosts
to allow future SSH connections from themaster
container to theslave
container without a (yes/no) promptmpirun --host master,slave hostname
to check if the connection is set up correctly (this should printmaster
andslave
)mpirun --host master,slave python3 src/example_ddp.py
(the final parameters of the model should be the same for all ranks, which indicates that the distributed training was successful)
-
Notifications
You must be signed in to change notification settings - Fork 1
License
jkhlr/pytorch-distributed-rnn
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published