disable cuda_h2d stream #6020

lixinqi · 2021-08-24T04:34:51Z

背景介绍：https://github.com/Oneflow-Inc/OneTeam/issues/557

chengtbf · 2021-08-25T05:21:25Z

oneflow/core/framework/device.cpp

@@ -84,7 +84,7 @@ Maybe<const std::string&> Device::of_type() const {
 Maybe<const std::string&> GetLocalCallInstructionName(const std::string& type) {
  static const HashMap<std::string, std::string> type2instr_name{
      {"cpu", "cpu.LocalCallOpKernel"},           {"cuda", "gpu.LocalCallOpKernel"},
-      {"gpu", "gpu.LocalCallOpKernel"},           {"cuda_h2d", "cuda_h2d.LocalCallOpKernel"},
+      {"gpu", "gpu.LocalCallOpKernel"},           {"cuda_h2d", "gpu.LocalCallOpKernel"},


需要加一行注释么

…low into disable_cuda_h2d_stream

github-actions · 2021-08-25T12:23:59Z

Speed stats:

GPU Name: GeForce GTX 1080 

PyTorch resnet50 time: 140.6ms (= 7028.8ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 128.3ms (= 6415.0ms / 50, input_shape=[16, 3, 224, 224], backward is enabled)
Relative speed: 1.10 (= 140.6ms / 128.3ms)

PyTorch resnet50 time: 83.7ms (= 4185.2ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 74.7ms (= 3733.7ms / 50, input_shape=[8, 3, 224, 224], backward is enabled)
Relative speed: 1.12 (= 83.7ms / 74.7ms)

PyTorch resnet50 time: 56.4ms (= 2820.5ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 47.8ms (= 2391.7ms / 50, input_shape=[4, 3, 224, 224], backward is enabled)
Relative speed: 1.18 (= 56.4ms / 47.8ms)

PyTorch resnet50 time: 47.0ms (= 2350.9ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 38.3ms (= 1917.1ms / 50, input_shape=[2, 3, 224, 224], backward is enabled)
Relative speed: 1.23 (= 47.0ms / 38.3ms)

PyTorch resnet50 time: 42.9ms (= 2142.7ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 42.2ms (= 2108.6ms / 50, input_shape=[1, 3, 224, 224], backward is enabled)
Relative speed: 1.02 (= 42.9ms / 42.2ms)

This reverts commit d847e11.

* async launched allreduce * ReleaseTensor instruction per stream * Revert "disable cuda_h2d stream (#6020)" This reverts commit d847e11. * restore "/ world_size" Signed-off-by: daquexian <daquexian566@gmail.com> * add soft sync before release tensor and local call Signed-off-by: daquexian <daquexian566@gmail.com> * add need_soft_sync_stream, only get producer value when last_used_device is not none Signed-off-by: daquexian <daquexian566@gmail.com> * refine Signed-off-by: daquexian <daquexian566@gmail.com> * fix bug Signed-off-by: daquexian <daquexian566@gmail.com> * remove need_soft_sync_stream table Signed-off-by: daquexian <daquexian566@gmail.com> * fix comments Signed-off-by: daquexian <daquexian566@gmail.com> * auto format by CI * update ddp speed test threshold Signed-off-by: daquexian <daquexian566@gmail.com> Co-authored-by: lixinqi <lixinqi0703106@163.com> Co-authored-by: oneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Co-authored-by: oneflow-ci-bot <ci-bot@oneflow.org>

disable cuda_h2d stream

84baf2d

lixinqi requested review from chengtbf, daquexian and oneflow-ci-bot August 24, 2021 06:21

lixinqi added automerge bug eager enhancement system labels Aug 24, 2021

Merge branch 'master' into disable_cuda_h2d_stream

e542324

daquexian approved these changes Aug 24, 2021

View reviewed changes

oneflow-ci-bot removed their request for review August 24, 2021 06:58

Merge branch 'master' into disable_cuda_h2d_stream

8309356

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 24, 2021 06:58

Merge branch 'master' into disable_cuda_h2d_stream

334b323

oneflow-ci-bot self-requested a review August 24, 2021 10:20

Merge branch 'master' into disable_cuda_h2d_stream

ef92a7e

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 24, 2021 11:29

Merge branch 'master' into disable_cuda_h2d_stream

664f8fe

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 24, 2021 15:43

Merge branch 'master' into disable_cuda_h2d_stream

09fcba9

oneflow-ci-bot self-requested a review August 24, 2021 17:23

Merge branch 'master' into disable_cuda_h2d_stream

79425f4

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 24, 2021 18:23

Merge branch 'master' into disable_cuda_h2d_stream

26698f5

Merge branch 'master' into disable_cuda_h2d_stream

cede374

oneflow-ci-bot self-requested a review August 24, 2021 23:53

Merge branch 'master' into disable_cuda_h2d_stream

0c5f1fc

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 25, 2021 01:32

Merge branch 'master' into disable_cuda_h2d_stream

b291ca8

oneflow-ci-bot self-requested a review August 25, 2021 05:20

chengtbf reviewed Aug 25, 2021

View reviewed changes

Merge branch 'master' into disable_cuda_h2d_stream

e94d64d

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 25, 2021 06:30

lixinqi and others added 4 commits August 25, 2021 14:54

Merge branch 'master' into disable_cuda_h2d_stream

1d84ad1

gpu.LocalCallOpKernel is shared between device and device

ec2ee4a

Merge branch 'disable_cuda_h2d_stream' of github.com:Oneflow-Inc/onef…

dd226f4

…low into disable_cuda_h2d_stream

Merge branch 'master' into disable_cuda_h2d_stream

c7ad40e

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 25, 2021 08:00

Merge branch 'master' into disable_cuda_h2d_stream

f4e39d0

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot August 25, 2021 09:39

Merge branch 'master' into disable_cuda_h2d_stream

9eff31e

oneflow-ci-bot self-requested a review August 25, 2021 11:25

oneflow-ci-bot merged commit d847e11 into master Aug 25, 2021

oneflow-ci-bot deleted the disable_cuda_h2d_stream branch August 25, 2021 12:32

oneflow-ci-bot removed their request for review August 25, 2021 12:32

daquexian added a commit that referenced this pull request Sep 13, 2021

Revert "disable cuda_h2d stream (#6020)"

fbed6a5

This reverts commit d847e11.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

disable cuda_h2d stream #6020

disable cuda_h2d stream #6020

lixinqi commented Aug 24, 2021 •

edited

Loading

chengtbf Aug 25, 2021

lixinqi Aug 25, 2021

github-actions bot commented Aug 25, 2021

disable cuda_h2d stream #6020

disable cuda_h2d stream #6020

Conversation

lixinqi commented Aug 24, 2021 • edited Loading

chengtbf Aug 25, 2021

Choose a reason for hiding this comment

lixinqi Aug 25, 2021

Choose a reason for hiding this comment

github-actions bot commented Aug 25, 2021

lixinqi commented Aug 24, 2021 •

edited

Loading