Skip to content

Optimizer Design and related Operator #3655

Closed
@jacquesqiao

Description

@jacquesqiao
  1. Implement various optimizer operators.
    1. SGD (done) and others listed in https://github.com/PaddlePaddle/Paddle/projects/22
  2. Add optimizer operators into BlockDesc.
  3. Python module

In the new framework with operators, we will use operators to build optimizer, and operators will be add to some block. There are some works to do:

  • write a python wrapper of optimizer and provide proper interface for user to use.
  • optimizer of multi GPU and multi machine is different then single device.
    • because it will be a seperate stage to update the parameters.
    • it will communicate with parameter server

there are three situation to be considered:

  1. single machine single device.
  2. single machine multiple devices.
  3. multiple machine.

Plan:

  • 8/24/2017 do some survey and discussion about tensorflow, caffe2 and pytorch on the python side. [done] Optimizer Survey #3672
  • 8/25/2017 do some survey and discussion about distribute optimizers.
  • 8/25/2017 design doc

distribute related issue: #3656

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions