-
Notifications
You must be signed in to change notification settings - Fork 825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
source op support s and fixed generator bug #7571
Conversation
跑./test_consistent_arange.py会报错,报错信息: 跑./test_consistent_randperm.py会报错,报错信息: 补充torch.arange 文档链接: https://pytorch.org/docs/stable/generated/torch.arange.html |
合并 fix-all_Sbp4ArgNameAndIndex_bug 分支后跑 test_consistent_randperm.py 还会报错,报错信息: 跑 test_consistent_arange.py 报错,报错信息 |
第二个看起来是不能切分导致的;第一个似乎是在哪里又调用了 SbpParallel4ArgNameAndIndex 接口,定位一下应该不难解决。 |
View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/7571/ |
CI failed when running job: cpu-module. PR label automerge has been removed |
View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/7571/ |
Speed stats:
|
CI failed when running job: cuda-benchmark. PR label automerge has been removed |
CI failed when running job: cuda-module. PR label automerge has been removed |
FAILED python/oneflow/test/modules/test_one_embedding_ftrl.py::TestOptimizers::test_ftrl |
本地跑没有问题 |
View latest API docs preview at: https://staging.oneflow.info/docs/Oneflow-Inc/oneflow/pr/7571/ |
CI failed when running job: cuda-benchmark. PR label automerge has been removed |
这个PR的目的
random op 支持 global tensor 一致性
以上方案 是通过与xiaoyu,yinggang开会总结出来的
fixed: https://github.com/Oneflow-Inc/OneTeam/issues/1167