Closed
Description
- Implement various optimizer operators.
- SGD (done) and others listed in https://github.com/PaddlePaddle/Paddle/projects/22
- Add optimizer operators into BlockDesc.
- Python module
In the new framework with operators, we will use operators to build optimizer, and operators will be add to some block. There are some works to do:
- write a python wrapper of optimizer and provide proper interface for user to use.
- optimizer of multi GPU and multi machine is different then single device.
- because it will be a seperate stage to update the parameters.
- it will communicate with parameter server
there are three situation to be considered:
- single machine single device.
- single machine multiple devices.
- multiple machine.
Plan:
- 8/24/2017 do some survey and discussion about tensorflow, caffe2 and pytorch on the python side. [done] Optimizer Survey #3672
- 8/25/2017 do some survey and discussion about distribute optimizers.
- 8/25/2017 design doc
distribute related issue: #3656
Metadata
Metadata
Assignees
Labels
No labels