-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Avoid unused parameters assert by default #1039
Conversation
…oft/DeepSpeed into olruwase/assert_unused_parameters
Changes behavior of #945. |
@tjruwase Thanks for notification. I noticed you changed two things, I not very sure why that is better, can you explain?
|
Besides, I found you forgot to change the doc: https://github.com/microsoft/DeepSpeed/blob/master/docs/_pages/config-json.md#zero-optimizations-for-fp16-training. |
* Unused parameters assert should be disabled by default * Fix message * Invert assert logic in unit test * Change option for ignoring unused parameters Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Enabling this by default will break dynamic networks such as g-shard and MoE, which train a subset of parameters in different iterations.
I think these two are doing slightly different things. |
@tjruwase Thanks for the detailed explanation, especially the difference of |
* Unused parameters assert should be disabled by default * Fix message * Invert assert logic in unit test * Change option for ignoring unused parameters Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Unused parameters assert should be disabled by default