Feature/transpiler split tensor to multiple pservers #7249

typhoonzero · 2018-01-05T11:34:20Z

TODO:

need to support SelectedRows will do in next PR.
add the unit test for this transpiler

This PR will use a transpiler to split variable before send to multiple servers, like:

split - - send - -  concat
      | - send - |   
      | - send - |

Yancey0623 · 2018-01-12T07:58:27Z

python/paddle/v2/fluid/distribute_transpiler_simple.py

+    return param_grad_map
+
+
+class DistributeTranspiler:


I'm confusing that why we need two DistributeTranspiler?

Rename this class to SimpleDistributeTranspiler and append to https://github.com/PaddlePaddle/Paddle/blob/develop/python/paddle/v2/fluid/__init__.py#L25

… transpiler_split_tensor

Yancey0623

LGTM!!

helinwang · 2018-01-15T22:45:30Z

python/paddle/v2/fluid/distribute_transpiler.py

+    """
+        We may need to split dense tensor to one or several blocks and put
+        them equally onto parameter server. One block is a sub-tensor
+        aligned by dim[0] of the tensor.


Is it necessary to align by dim[0]? seems sharding has nothing to do with the shape of the lod tensor. (is this actually for select row tensor?)

Indeed, that's true, we only need to know the original shape when concat updated parameters.

transpiler_split_tensor

ed55f1b

typhoonzero changed the title ~~[] transpiler split tensor to multiple pservers~~ [WIP] transpiler split tensor to multiple pservers Jan 5, 2018

typhoonzero closed this Jan 5, 2018

typhoonzero added 2 commits January 5, 2018 19:35

add splitter

c70ea1c

split tensor to pservers

f35c560

typhoonzero reopened this Jan 9, 2018

typhoonzero and others added 5 commits January 9, 2018 16:23

trainer ok

56e758f

update wip pserver transpile

9c0b1cf

transpile program ok

50a02ad

left startup program bug

6fa56b9

fix startup program shape

2827607

typhoonzero changed the title ~~[WIP] transpiler split tensor to multiple pservers~~ [WIP] Feature/transpiler split tensor to multiple pservers Jan 11, 2018

typhoonzero and others added 3 commits January 11, 2018 20:06

debugging shape match

5325313

update

5d901d0

Done, need support selectedrows

5faebab

typhoonzero changed the title ~~[WIP] Feature/transpiler split tensor to multiple pservers~~ Feature/transpiler split tensor to multiple pservers Jan 12, 2018

Yancey0623 reviewed Jan 12, 2018

View reviewed changes

typhoonzero added 2 commits January 12, 2018 16:30

update unit test split var

c24da0d

follow comments

06b326b

typhoonzero requested review from helinwang and putcn January 12, 2018 09:09

typhoonzero added 3 commits January 12, 2018 17:28

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

df50c14

… transpiler_split_tensor

add cmakelist

b886b82

rename dist tests

c996eb8

Yancey0623 approved these changes Jan 15, 2018

View reviewed changes

typhoonzero merged commit 8d253e4 into PaddlePaddle:develop Jan 15, 2018

typhoonzero deleted the transpiler_split_tensor branch January 15, 2018 07:03

helinwang reviewed Jan 16, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/transpiler split tensor to multiple pservers #7249

Feature/transpiler split tensor to multiple pservers #7249

typhoonzero commented Jan 5, 2018 •

edited

Loading

Yancey0623 Jan 12, 2018

Yancey0623 Jan 12, 2018

Yancey0623 left a comment

helinwang Jan 15, 2018

typhoonzero Jan 16, 2018

Feature/transpiler split tensor to multiple pservers #7249

Feature/transpiler split tensor to multiple pservers #7249

Conversation

typhoonzero commented Jan 5, 2018 • edited Loading

Yancey0623 Jan 12, 2018

Choose a reason for hiding this comment

Yancey0623 Jan 12, 2018

Choose a reason for hiding this comment

Yancey0623 left a comment

Choose a reason for hiding this comment

helinwang Jan 15, 2018

Choose a reason for hiding this comment

typhoonzero Jan 16, 2018

Choose a reason for hiding this comment

typhoonzero commented Jan 5, 2018 •

edited

Loading