Skip to content

Conversation

@wzzju
Copy link
Contributor

@wzzju wzzju commented Dec 7, 2020

PR types

Performance optimization

PR changes

OPs

Describe

Remove tensor copy in the update_loss_scaling op.

Batch Size = 128
image

Batch Size = 208
image

@paddle-bot-old
Copy link

paddle-bot-old bot commented Dec 7, 2020

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

phlrain
phlrain previously approved these changes Dec 7, 2020
@phlrain phlrain merged commit 5ac71b3 into PaddlePaddle:develop Dec 10, 2020
wzzju added a commit to wzzju/Paddle that referenced this pull request Jan 11, 2021
* remove tensor copy in the update_loss_scaling op

* not use thrust.

* fix some cuda memory access error.
lanxianghit pushed a commit that referenced this pull request Jan 11, 2021
* Support pure fp16 training for AMP API. (#29544)

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

* Remove tensor copy in the update_loss_scaling op. (#29426)

* remove tensor copy in the update_loss_scaling op

* not use thrust.

* fix some cuda memory access error.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants