Description
命令:python -m oneflow.distributed.launch --nproc_per_node 2 train.py --batch 64 --data data/mydata.yaml --device 0,1
环境: python3.9+cu117+oneflow0.9.0,2080ti
Traceback (most recent call last):
File "/home/liaozhongke/deeplearning/one-yolov5/train.py", line 672, in
main(opt)
File "/home/liaozhongke/deeplearning/one-yolov5/train.py", line 568, in main
train(opt.hyp, opt, device, callbacks)
File "/home/liaozhongke/deeplearning/one-yolov5/train.py", line 145, in train
with torch_distributed_zero_first(LOCAL_RANK):
File "/home/liaozhongke/miniconda3/envs/oneflow/lib/python3.9/contextlib.py", line 119, in enter
return next(self.gen)
File "/home/liaozhongke/deeplearning/one-yolov5/utils/torch_utils.py", line 87, in torch_distributed_zero_first
dist.barrier()
AttributeError: module 'oneflow.distributed' has no attribute 'barrier'