-
Notifications
You must be signed in to change notification settings - Fork 825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix scatter ops eager global bug and add test #7807
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
甚好
from oneflow.test_utils.automated_test_util import * | ||
|
||
|
||
@autotest(n=10, auto_backward=True, check_graph=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
说到这个次数,现在CI用的都是10次吗?
我记得上次晟航说需要加速CI,是不是说1次或者3次就够了呀? @hjchen2
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
因为里面有多个 random_sbp,1 次或 3 次覆盖不够,这个算子很快,10 次应该正好能测的全且速度不慢
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
他这里可以循环10次,因为调用这个函数的地方只对placement进行了迭代,迭代次数是比较少的。
CI failed when running job: cuda-module-distributed-rank-1. PR label automerge has been removed |
CI failed when running job: cuda-module-distributed-rank-1. PR label automerge has been removed |
View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/7807/ |
CI failed when running job: cuda-module. PR label automerge has been removed |
View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/7807/ |
Speed stats:
|
修复 ScatterAdd、ScatterUpdate、ScatterScalarUpdate sbp 推导的 bug,添加相关测试。