Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix flow.load in multi nodes and add test #8314

Merged
merged 7 commits into from
May 27, 2022
Merged

Conversation

daquexian
Copy link
Contributor

修复多机 flow.load 出错的问题

Signed-off-by: daquexian <daquexian566@gmail.com>
@@ -295,10 +295,10 @@ def load(path: str, global_src_rank: Optional[int] = None,) -> Any:
The loaded object
"""
path: Path = Path(path)
assert path.is_dir(), "Directory {} doesn't exist!".format(path)
pickle_path = path / PICKLE_FILENAME
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

之前这里另外一个膜而已 dir 的机器会报错对吧。报错后不会卡住,会导致所有rank最终都挂掉?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

嗯是的

@daquexian daquexian requested a review from oneflow-ci-bot May 27, 2022 01:22
@daquexian daquexian enabled auto-merge (squash) May 27, 2022 01:22
@github-actions
Copy link
Contributor

View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/8314/

@daquexian daquexian merged commit ede579f into master May 27, 2022
@daquexian daquexian deleted the fix_load_in_multi_nodes branch May 27, 2022 05:26
@CPFLAME CPFLAME mentioned this pull request May 27, 2022
2 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants