Open
Description
When I ran cifar10_alexnet
on cifar10 dataset, I used terngrad and set FLAGS.floating_grad_epoch=0
(leave other arguments default), achieved top_1=86.04
after 300000 steps.
And then I want to see original accuracy without terngrad and set FLAGS.floating_grad_epoch=1
,but I got top_1=85.64
after 300000 steps, which is not consistent with results in paper. I want to make sure that I used in a correct way.
By the way, if I want to test it on other models like densenet, what arguments should I modified? (weight decay? base lr?)
Metadata
Metadata
Assignees
Labels
No labels
Activity