-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Add flexibility of pipeline parallel module and engine #1399
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Heh, @thomasw21 was just dealing with it - bigscience-workshop/Megatron-DeepSpeed#107 You can see from that PR @ShadenSmith is looking at fixing bool tensor support and removing this hack altogether. |
Hey thanks @stas00 . Yeah I've been working to fix some issues I've been having with current pipeline parallelism implementation. I've just open a draft PR if you want to check it out. #1400 Note it doesn't work yet, and I'm in the process of debugging some deadlock issue. |
|
@hyunwoongko, thanks for your hard work on this PR. I will take a look today so we can wrap it up soon. Thanks for your patience. |
|
@hyunwoongko, I left some comments. Thanks. |
|
I have separated the two PRs separately. |
What for?
Add flexibility of pipeline parallel module and engine (#1347)
Request review
@ShadenSmith @jeffra @sdtblck