Conversation
|
With the torch version I recover the SOTA results indicated on papers with code, which are:
|
|
Currently needs this version of tf2cv : https://github.com/zaccharieramzi/imgclsmob/tree/config-wrn, for the TF WRN. This is because the serialization that we need (in order to apply coupled weight decay), does not work with the pypi version Also, it does not work for tf<=2.8 EDITNow my custom version of tf2cv is not needed anymore for the TF WRN to work in this benchmark. |
|
For now the TF version does not work: it's not learning despite using the exact same parameters than PyTorch on CIFAR-10. |
|
The problem is the following: EDITActually, we can work something out, and it would be helpful to handle models that are not exactly "flat". I think this makes the application of coupled weight decay more robust. |
…dels with complicated implementations
|
TF version of WRN now working (96.08 on CIFAR-10) -> this is ready to merge. |
Wide ResNet seem to be a very popular variant of ResNets, achieving sota in many benchmarks such as CIFAR-100, and being used in recent research papers in big institutions (like DeepMind, Google), sometimes with variants on init, regularization and data augmentation.