-
-
Notifications
You must be signed in to change notification settings - Fork 10.7k
[RL] fast weight update with zmq + ipc handles #24295
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RL] fast weight update with zmq + ipc handles #24295
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request introduces a new mechanism for fast, in-place model weight updates using ZMQ for inter-process communication, which is a good improvement for RLHF-style workloads. The implementation on the worker side is mostly solid, but it relies on a brittle internal detail of PyTorch. Additionally, the example script demonstrating this feature contains a performance bottleneck that undermines the goal of "fast" updates. My review provides suggestions to address these issues.
05a7b7d
to
198631d
Compare
This pull request has merge conflicts that must be resolved before it can be |
e63d2e7
to
348093a
Compare
348093a
to
a64f369
Compare
Signed-off-by: huangweixiao <huangweixiao@msh.team>
Signed-off-by: huangweixiao <huangweixiao@msh.team>
1cfe7fb
to
b73dc08
Compare
Signed-off-by: huangweixiao <huangweixiao@msh.team>
b73dc08
to
35e57e5
Compare
Signed-off-by: youkaichao <youkaichao@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work!
Signed-off-by: huangweixiao <huangweixiao@msh.team> Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com>
Signed-off-by: huangweixiao <huangweixiao@msh.team> Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com>
Signed-off-by: huangweixiao <huangweixiao@msh.team> Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com>
Signed-off-by: huangweixiao <huangweixiao@msh.team> Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Purpose
Resolve #24163
Test Plan
Test Result
rlhf_colocate.py
is executed successfully