Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
bin		bin
docker/ubuntu		docker/ubuntu
documents		documents
pyrannc		pyrannc
singularity		singularity
src		src
submodules		submodules
test		test
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.py		setup.py

Repository files navigation

RaNNC (Rapid Neural Network Connector)

RaNNC is automatic parallelization middleware used to train very large-scale neural networks. Since modern networks often have billions of parameters, they do not fit the memory of GPUs. RaNNC automatically partitions such a huge network with model parallelism and computes it using multiple GPUs.

Compared to existing frameworks, including Megatron-LM and Mesh-TensorFlow, which require users to implement partitioning of the given network, RaNNC automatically partitions a network for PyTorch without any modification to its description. In addition, RaNNC basically has no limitation on its network architecture while the existing frameworks are only applicable to transformer-based networks.

The code below shows a simple usage of RaNNC. Following the style of PyTorch's data parallelism, RaNNC expects the training script to be launched with an MPI so that the number of processes matches the number of available GPUs.

model = Net()                  # Define a network
model.to(torch.device("cuda")) # Move parameters to a cuda device
optimizer = optim.Adam(model.parameters(), lr=0.01) # Define an optimizer
model = pyrannc.RaNNCModule(model, optimizer)  ##### Wrap by RaNNCModule #####
loss = model(input)            # Run a forward pass
loss.backward()                # Run a backward pass
optimizer.step()               # Update parameters

You only need to insert the line highlighted above. RaNNC profiles computation times and memory usage of the components in the network and determines the partitioning of the network so that each partitioned fragment fits the GPU memory and the training throughput is optimized.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RaNNC (Rapid Neural Network Connector)

About

Releases

Packages

Languages

License

tohtana/rannc

Folders and files

Latest commit

History

Repository files navigation

RaNNC (Rapid Neural Network Connector)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages