Rework the AMP for TF XLNet #10274

jplu · 2021-02-19T10:47:20Z

What does this PR do?

This PR reworks the AMP of XLNet to remove some useless casts for better and less confusing AMP compliancy.

sgugger

Wondering since there are bfloat16 tests here. Does the mixed precision handle bfloat16 on TPUs? If not aren't we removing this functionality here?

jplu · 2021-02-19T14:21:12Z

Yes, bfloat16 is only for TPU. Hence, we cannot really test it elsewhere than inside a TPU context. I have added the bfloat16 condition only if XLNet is run on TPU because we were handling a specific case when the model is run under AMP.

Rework casts

83c00c7

jplu requested review from patrickvonplaten, LysandreJik and sgugger February 19, 2021 11:00

sgugger reviewed Feb 19, 2021

View reviewed changes

patrickvonplaten approved these changes Feb 24, 2021

View reviewed changes

LysandreJik approved these changes Feb 24, 2021

View reviewed changes

LysandreJik merged commit cdcdd5f into huggingface:master Feb 24, 2021

jplu deleted the tf-xlnet-amp branch February 25, 2021 10:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework the AMP for TF XLNet #10274

Rework the AMP for TF XLNet #10274

jplu commented Feb 19, 2021

sgugger left a comment

jplu commented Feb 19, 2021

Rework the AMP for TF XLNet #10274

Rework the AMP for TF XLNet #10274

Conversation

jplu commented Feb 19, 2021

What does this PR do?

sgugger left a comment

Choose a reason for hiding this comment

jplu commented Feb 19, 2021