papers Notes and summaries of papers I've read Notes and Summaries Distributed Training Large Scale Distributed Deep Networks [notes][paper] Deep Gradient Compression: Reducing the communication bandwidth for distributed training [notes][paper] License MIT © Manraj Singh