Fea/nn graph/forward graph #5516

strint · 2021-07-15T16:16:55Z

forward job/graph

测试例子：

多输入、多输出
带有Parameter和Buffer
带有rule/matmul/flatten三种user op
Tensor和Module都to到了cuda
嵌套的Module

m = CustomModule()
m.to("cuda")
g = CustomGraph(m)

x = flow.Tensor(6, 6)
flow.nn.init.uniform_(x, a=-1.0, b=1.0)
x = x.to("cuda")

y = flow.Tensor(10, 10)
flow.nn.init.uniform_(y, a=-1.0, b=1.0)
y = y.to("cuda")

z, a = g._compile(x, y)

print("graph repr: ", repr(g))
print("graph proto: ", g._graph_proto)

输出：
graph repr:

(CustomGraph_0:CustomGraph:GRAPH): (
  (m:CustomModule:MODULE): (
    (m.layer:SubModule:MODULE): (
      (m.layer.relu:ReLU:MODULE): ()
      (m.layer.weight:Parameter:PARAMETER): ()
    )
    (m.dummy_buff:Tensor:BUFFER): ()
  )
)

graph proto :

net {
  op {
    name: "input_0"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427461631
    input_conf {
      out: "out"
      blob_conf {
        shape {
          dim: 6
          dim: 6
        }
        data_type: kFloat
        is_dynamic: true
        parallel_distribution {
          sbp_parallel {
            broadcast_parallel {
            }
          }
        }
      }
    }
  }
  op {
    name: "input_1"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427461631
    input_conf {
      out: "out"
      blob_conf {
        shape {
          dim: 10
          dim: 10
        }
        data_type: kFloat
        is_dynamic: true
        parallel_distribution {
          sbp_parallel {
            broadcast_parallel {
            }
          }
        }
      }
    }
  }
  op {
    name: "m.layer.weight"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427473919
    variable_conf {
      out: "out"
      shape {
        dim: 6
        dim: 6
      }
      data_type: kFloat
      initializer {
        empty_conf {
        }
      }
    }
  }
  op {
    name: "m.layer-matmul_0"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427469823
    user_conf {
      op_type_name: "matmul"
      input {
        key: "a"
        value {
          s: "input_0/out"
        }
      }
      input {
        key: "b"
        value {
          s: "m.layer.weight/out"
        }
      }
      output {
        key: "out"
        value {
          s: "m.layer-matmul_0/out_0"
        }
      }
      attr {
        key: "alpha"
        value {
          at_double: 1.0
        }
      }
      attr {
        key: "transpose_a"
        value {
          at_bool: false
        }
      }
      attr {
        key: "transpose_b"
        value {
          at_bool: false
        }
      }
    }
  }
  op {
    name: "m.layer.relu-relu_1"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427478015
    user_conf {
      op_type_name: "relu"
      input {
        key: "in"
        value {
          s: "m.layer-matmul_0/out_0"
        }
      }
      output {
        key: "out"
        value {
          s: "m.layer.relu-relu_1/out_0"
        }
      }
    }
  }
  op {
    name: "m.layer.relu-relu_2"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427478015
    user_conf {
      op_type_name: "relu"
      input {
        key: "in"
        value {
          s: "input_1/out"
        }
      }
      output {
        key: "out"
        value {
          s: "m.layer.relu-relu_2/out_0"
        }
      }
    }
  }
  op {
    name: "m-flatten_3"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427465727
    user_conf {
      op_type_name: "flatten"
      input {
        key: "in"
        value {
          s: "m.layer.relu-relu_1/out_0"
        }
      }
      output {
        key: "out"
        value {
          s: "m-flatten_3/out_0"
        }
      }
      attr {
        key: "end_dim"
        value {
          at_int32: -1
        }
      }
      attr {
        key: "start_dim"
        value {
          at_int32: 1
        }
      }
    }
  }
  op {
    name: "m.dummy_buff"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427482111
    variable_conf {
      out: "out"
      shape {
        dim: 6
        dim: 8
      }
      data_type: kFloat
      initializer {
        empty_conf {
        }
      }
      trainable: false
    }
  }
  op {
    name: "m-matmul_4"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427465727
    user_conf {
      op_type_name: "matmul"
      input {
        key: "a"
        value {
          s: "m-flatten_3/out_0"
        }
      }
      input {
        key: "b"
        value {
          s: "m.dummy_buff/out"
        }
      }
      output {
        key: "out"
        value {
          s: "m-matmul_4/out_0"
        }
      }
      attr {
        key: "alpha"
        value {
          at_double: 1.0
        }
      }
      attr {
        key: "transpose_a"
        value {
          at_bool: false
        }
      }
      attr {
        key: "transpose_b"
        value {
          at_bool: false
        }
      }
    }
  }
  op {
    name: "output_0"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427461631
    output_conf {
      in: "m-matmul_4/out_0"
      out: "out"
      blob_conf {
        shape {
          dim: 6
          dim: 8
        }
        data_type: kFloat
        is_dynamic: false
        parallel_distribution {
          sbp_parallel {
            broadcast_parallel {
            }
          }
        }
      }
    }
  }
  op {
    name: "output_1"
    device_tag: "gpu"
    scope_symbol_id: 4611686018427461631
    output_conf {
      in: "m.layer.relu-relu_2/out_0"
      out: "out"
      blob_conf {
        shape {
          dim: 10
          dim: 10
        }
        data_type: kFloat
        is_dynamic: false
        parallel_distribution {
          sbp_parallel {
            broadcast_parallel {
            }
          }
        }
      }
    }
  }
}
placement {
  placement_group {
    op_set {
      op_name: "input_0"
      op_name: "input_1"
      op_name: "m.layer.weight"
      op_name: "m.layer-matmul_0"
      op_name: "m.layer.relu-relu_1"
      op_name: "m.layer.relu-relu_2"
      op_name: "m-flatten_3"
      op_name: "m.dummy_buff"
      op_name: "m-matmul_4"
      op_name: "output_0"
      op_name: "output_1"
    }
    parallel_conf {
      device_name: "0:0-3"
      device_tag: "gpu"
      hierarchy {
        dim: 4
      }
    }
  }
  blob_placement_group {
    lbi {
      op_name: "input_0"
      blob_name: "out"
    }
    lbi {
      op_name: "input_1"
      blob_name: "out"
    }
    lbi {
      op_name: "m.layer.weight"
      blob_name: "out"
    }
    lbi {
      op_name: "m.layer-matmul_0"
      blob_name: "out_0"
    }
    lbi {
      op_name: "m.layer.relu-relu_1"
      blob_name: "out_0"
    }
    lbi {
      op_name: "m.layer.relu-relu_2"
      blob_name: "out_0"
    }
    lbi {
      op_name: "m-flatten_3"
      blob_name: "out_0"
    }
    lbi {
      op_name: "m.dummy_buff"
      blob_name: "out"
    }
    lbi {
      op_name: "m-matmul_4"
      blob_name: "out_0"
    }
    lbi {
      op_name: "output_0"
      blob_name: "out"
    }
    lbi {
      op_name: "output_1"
      blob_name: "out"
    }
    parallel_conf {
      device_name: "0:0-3"
      device_tag: "gpu"
      hierarchy {
        dim: 4
      }
    }
  }
}
job_conf {
  job_name: "CustomGraph_0"
  predict_conf {
  }
}
job_parallel_view_conf {
}

…fea/nn_graph/forward_graph

oneflow/python/nn/graph.py

…ow-Inc/oneflow into fea/nn_graph/forward_graph

…dev_cc_user_op_lazy_interpret

…ow-Inc/oneflow into fea/nn_graph/forward_graph

oneflow/python/framework/graph_build_util.py

oneflow/python/nn/graph.py

oneflow/python/nn/graph_block.py

…ow-Inc/oneflow into fea/nn_graph/forward_graph

oneflow/python/nn/graph.py

github-actions · 2021-07-21T11:42:16Z

CI failed, removing label automerge

oneflow/python/framework/graph_build_util.py

github-actions · 2021-07-21T14:43:58Z

Speed stats:

GPU Name: GeForce GTX 1080 

PyTorch resnet50 time: 140.4ms (= 4212.8ms / 30, input_shape=[16, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 126.1ms (= 3784.3ms / 30, input_shape=[16, 3, 224, 224], backward is enabled)
Relative speed: 1.11 (= 140.4ms / 126.1ms)

PyTorch resnet50 time: 85.1ms (= 2554.0ms / 30, input_shape=[8, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 73.6ms (= 2207.0ms / 30, input_shape=[8, 3, 224, 224], backward is enabled)
Relative speed: 1.16 (= 85.1ms / 73.6ms)

PyTorch resnet50 time: 62.4ms (= 1870.7ms / 30, input_shape=[4, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 48.9ms (= 1467.6ms / 30, input_shape=[4, 3, 224, 224], backward is enabled)
Relative speed: 1.27 (= 62.4ms / 48.9ms)

PyTorch resnet50 time: 49.1ms (= 1473.9ms / 30, input_shape=[2, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 46.2ms (= 1386.8ms / 30, input_shape=[2, 3, 224, 224], backward is enabled)
Relative speed: 1.06 (= 49.1ms / 46.2ms)

PyTorch resnet50 time: 43.3ms (= 1299.6ms / 30, input_shape=[1, 3, 224, 224], backward is enabled)
OneFlow resnet50 time: 47.3ms (= 1419.5ms / 30, input_shape=[1, 3, 224, 224], backward is enabled)
Relative speed: 0.92 (= 43.3ms / 47.3ms)

strint added 3 commits July 16, 2021 00:12

add test on add input to graph

8cd50d4

Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …

7e90043

…fea/nn_graph/forward_graph

add var into graph

618b2f6

strint marked this pull request as ready for review July 16, 2021 08:26

strint requested review from chengtbf and leaves-zwx July 16, 2021 08:26

strint added feature system labels Jul 16, 2021

Merge branch 'master' into fea/nn_graph/forward_graph

ff18d3a

chengtbf reviewed Jul 16, 2021

View reviewed changes

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

chengtbf reviewed Jul 16, 2021

View reviewed changes

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

chengtbf and others added 19 commits July 16, 2021 20:49

LazyInterpreter for FetchOutputOpExpr and set op parallel_distribution

53d2f44

Merge branch 'master' into dev_cc_fetch_output_opexpr

0efa1e5

refine input var build

9acf091

Merge branch 'fea/nn_graph/forward_graph' of https://github.com/Onefl…

70a9dca

…ow-Inc/oneflow into fea/nn_graph/forward_graph

split file

d05ca9a

rename

2c6bc52

mini refine

597f364

Merge branch 'master' into dev_cc_fetch_output_opexpr

bc74933

Merge branch 'master' into dev_cc_fetch_output_opexpr

0ec1317

Add note

5ac7460

LazyInterpret::ApplyImpl for UserOpExpr

3d49763

refine test scripts

a3ca909

Merge branch 'dev_cc_fetch_output_opexpr' of https://github.com/Onefl…

34af7fd

…ow-Inc/oneflow into fea/nn_graph/forward_graph

add output to graph

ff7b873

format

d4b03df

Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …

7e6491f

…dev_cc_user_op_lazy_interpret

Fix bug of LazyInterpret UserOpExpr for change output lbns

b5601a5

Add test user op expr test

78bdeaf

Merge branch 'master' into dev_cc_user_op_lazy_interpret

2e468ca

strint added 3 commits July 21, 2021 17:32

Merge branch 'fea/nn_graph/forward_graph' of https://github.com/Onefl…

5f4ab09

…ow-Inc/oneflow into fea/nn_graph/forward_graph

address review

ffa0e08

Merge branch 'master' into fea/nn_graph/forward_graph

12fb042

strint requested a review from oneflow-ci-bot July 21, 2021 09:44

chengtbf reviewed Jul 21, 2021

View reviewed changes

oneflow/python/framework/graph_build_util.py Show resolved Hide resolved

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

leaves-zwx reviewed Jul 21, 2021

View reviewed changes

oneflow/python/nn/graph.py Show resolved Hide resolved

leaves-zwx reviewed Jul 21, 2021

View reviewed changes

oneflow/python/nn/graph_block.py Show resolved Hide resolved

chengtbf mentioned this pull request Jul 21, 2021

Lazy job stream type #5389

Merged

strint added 3 commits July 21, 2021 18:18

save i/o/s op_name and tensor for c_nn_graph

ef351eb

Merge branch 'fea/nn_graph/forward_graph' of https://github.com/Onefl…

642d035

…ow-Inc/oneflow into fea/nn_graph/forward_graph

address review

d5174c9

chengtbf reviewed Jul 21, 2021

View reviewed changes

oneflow/python/nn/graph.py Outdated Show resolved Hide resolved

chengtbf approved these changes Jul 21, 2021

View reviewed changes

strint added the automerge label Jul 21, 2021

strint requested review from oneflow-ci-bot and removed request for oneflow-ci-bot July 21, 2021 11:10

github-actions bot removed the automerge label Jul 21, 2021

leaves-zwx reviewed Jul 21, 2021

View reviewed changes

oneflow/python/framework/graph_build_util.py Show resolved Hide resolved

oneflow-ci-bot removed their request for review July 21, 2021 12:17

strint added 3 commits July 21, 2021 20:28

adjust test

6f9d449

refine build_graph_state

d0e4f3d

Merge branch 'master' into fea/nn_graph/forward_graph

afc1541

strint requested a review from oneflow-ci-bot July 21, 2021 13:52

strint added the automerge label Jul 21, 2021

leaves-zwx approved these changes Jul 21, 2021

View reviewed changes

oneflow-ci-bot removed their request for review July 21, 2021 15:11

oneflow-ci-bot merged commit ab8aab8 into master Jul 21, 2021

oneflow-ci-bot deleted the fea/nn_graph/forward_graph branch July 21, 2021 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fea/nn graph/forward graph #5516

Fea/nn graph/forward graph #5516

strint commented Jul 15, 2021 •

edited

Loading

github-actions bot commented Jul 21, 2021

github-actions bot commented Jul 21, 2021

Fea/nn graph/forward graph #5516

Fea/nn graph/forward graph #5516

Conversation

strint commented Jul 15, 2021 • edited Loading

github-actions bot commented Jul 21, 2021

github-actions bot commented Jul 21, 2021

strint commented Jul 15, 2021 •

edited

Loading