A compressed adaptive optimizer for training large-scale deep learning models using PyTorch
hashing deep-learning neural-network pytorch transformer imagenet count-min-sketch language-model adagrad adam-optimizer sgd-optimizer count-sketch sgd-momentum
-
Updated
Nov 26, 2019 - Python