warmup_steps: dataset: WMT16 | warmup steps | passed steps before NAN | | - | - | | 1 | 2 | | 10 | 7 | | 100 | 494 | | 1000 | 751 | | 10000 | - |