Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Auto Parallel] fix data stream bug of dist.to_static #9077

Merged
merged 1 commit into from
Sep 5, 2024

Conversation

zhangyuqin1998
Copy link
Contributor

PR types

Bug fixes

PR changes

Others

Description

to_static方法会对传入的dataloader调用next,以获取input的spec信息。但调用next方法会改变dataloader中sampler的状态。一些场景下,sampler的状态无法恢复,导致dataloader的数据流出现错乱。因此,这里构建一个临时的dataloader,避免对实际训练所用的dataloader产生副作用影响

Copy link

paddle-bot bot commented Sep 4, 2024

Thanks for your contribution!

Copy link
Collaborator

@From00 From00 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wawltor wawltor merged commit 8ee99a4 into PaddlePaddle:develop Sep 5, 2024
9 of 12 checks passed
ckl117 pushed a commit to ckl117/PaddleNLP that referenced this pull request Sep 9, 2024
Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants