File tree 2 files changed +2
-0
lines changed 2 files changed +2
-0
lines changed Original file line number Diff line number Diff line change @@ -484,5 +484,6 @@ T2T](https://research.googleblog.com/2017/06/accelerating-deep-learning-research
484
484
* [ Adafactor: Adaptive Learning Rates with Sublinear Memory Cost] ( https://arxiv.org/abs/1804.04235 )
485
485
* [ Universal Transformers] ( https://arxiv.org/abs/1807.03819 )
486
486
* [ Attending to Mathematical Language with Transformers] ( https://arxiv.org/abs/1812.02825 )
487
+ * [ The Evolved Transformer] ( https://arxiv.org/abs/1901.11117 )
487
488
488
489
* Note: This is not an official Google product.*
Original file line number Diff line number Diff line change @@ -484,5 +484,6 @@ T2T](https://research.googleblog.com/2017/06/accelerating-deep-learning-research
484
484
* [ Adafactor: Adaptive Learning Rates with Sublinear Memory Cost] ( https://arxiv.org/abs/1804.04235 )
485
485
* [ Universal Transformers] ( https://arxiv.org/abs/1807.03819 )
486
486
* [ Attending to Mathematical Language with Transformers] ( https://arxiv.org/abs/1812.02825 )
487
+ * [ The Evolved Transformer] ( https://arxiv.org/abs/1901.11117 )
487
488
488
489
* Note: This is not an official Google product.*
You can’t perform that action at this time.
0 commit comments