Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the FSDP extension to make load_state_dict works for 2D. #570

Merged
merged 1 commit into from
Oct 25, 2022

Conversation

fegin
Copy link
Contributor

@fegin fegin commented Oct 22, 2022

FSDP should not see the global ShardedTensor but only the local or inner ShardedTensor. This PR fixes the issue and makes 2D parallelism checkpoint work.

Copy link
Contributor

@wanchaol wanchaol left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, thanks for the quick fix!

@fduwjj fduwjj self-requested a review October 22, 2022 03:52
Copy link
Contributor

@fduwjj fduwjj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the fix, does this mean the 2D test now pass?

Copy link
Contributor

@wz337 wz337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the quick fix. LGTM!

@wz337 wz337 mentioned this pull request Oct 24, 2022
@fegin fegin merged commit 0064095 into main Oct 25, 2022
@wanchaol wanchaol deleted the fsdp_extension_fix branch October 26, 2022 06:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants