Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] [deepspeed] from_pretrained deal with ignore_mismatched_sizes #17373

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

stas00
Copy link
Contributor

@stas00 stas00 commented May 20, 2022

An attempt to fix the issue reported #17336

Fixes: #17336

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

@stas00
Copy link
Contributor Author

stas00 commented May 25, 2022

After creating a test I discovered it breaks on tied variables since they get ignored in model.named_parameters - so back to the drawing table.

@huggingface huggingface deleted a comment from github-actions bot Jun 20, 2022
@huggingface huggingface deleted a comment from github-actions bot Jul 15, 2022
@LysandreJik LysandreJik added the WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress label Aug 10, 2022
@huggingface huggingface deleted a comment from github-actions bot Aug 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

issue with loading pretrained model using DeepSpeed Zero Stage 3
3 participants