API for obtaining global gradient norm #1292

tjruwase · 2021-08-09T21:51:09Z

API for obtaining global unclipped gradient norm across all parameters groups. Based off #1286.
Optimizers are solely responsible for computing gradient norms. Gradient norms are computed (or refreshed) in optimizer.step().

@stas00 FYI

…-science

stas00 · 2021-08-09T22:16:12Z

Thank you for working on this, @tjruwase

and then we will need Shaden's deepspeedai/Megatron-DeepSpeed#8 on the Megatron side once this is merged.

ShadenSmith

@samyam may also want to take a look

* FP16 fused and unfused grad norm query. * API for obtaining global unclipped gradient norm across parameter groups * Use global norm not group norms Co-authored-by: Shaden Smith <shaden.smith@microsoft.com>

Shaden Smith and others added 3 commits August 7, 2021 00:55

FP16 fused and unfused grad norm query.

e35dd69

Merge branch 'big-science' of github.com:microsoft/DeepSpeed into big…

f6b65ad

…-science

API for obtaining global unclipped gradient norm across parameter groups

5cda8e5

tjruwase requested a review from ShadenSmith August 9, 2021 21:51

tjruwase requested review from RezaYazdaniAminabadi, awan-10, cli99, conglongli, eltonzheng, jeffra, minjiaz, niumanar and samyam as code owners August 9, 2021 21:51

tjruwase requested review from samyam and removed request for RezaYazdaniAminabadi, awan-10, cli99, conglongli, eltonzheng, minjiaz, niumanar and samyam August 9, 2021 22:01

Use global norm not group norms

dd02eee

ShadenSmith approved these changes Aug 9, 2021

View reviewed changes

tjruwase merged commit cce85b8 into big-science Aug 9, 2021

mrwyattii deleted the olruwase/global_gradient_norm branch July 7, 2023 02:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API for obtaining global gradient norm #1292

API for obtaining global gradient norm #1292

Uh oh!

tjruwase commented Aug 9, 2021 •

edited

Loading

Uh oh!

stas00 commented Aug 9, 2021

Uh oh!

ShadenSmith left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

API for obtaining global gradient norm #1292

API for obtaining global gradient norm #1292

Uh oh!

Conversation

tjruwase commented Aug 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stas00 commented Aug 9, 2021

Uh oh!

ShadenSmith left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tjruwase commented Aug 9, 2021 •

edited

Loading