Fluid new API scaffolding #10313

helinwang · 2018-05-01T22:40:27Z

The scaffolding defines the scaffold of each class, so we can start implementing in parallel.

wangkuiyi · 2018-05-02T18:05:25Z

python/paddle/fluid/params.py

+        if path:
+            self._load(path)
+
+    def _load(self, path):


Why load is defined as a private method but save it public?

You are right, this is not optimal. We will remove this class: #10313 (comment)

Removed params.py in #10354

wangkuiyi · 2018-05-02T18:05:55Z

python/paddle/fluid/params.py

+
+class Params(object):
+    def __init__(self, path=None):
+        self.scope = core.Scope()


What is the relationship between a Params instance and this scope instance?

Params is a wrapper on top of scope, and provides additional functionality to save and load parameters.

We think it's not necessary, I will send a PR to remove it.

The new interface of the Trainer constructor will be:

def Trainer(object): def __init__(self, network_func, optimizer, param_path=None, scope=None, place=None):

The parameters of the trainer can be optional initialized by the saved parameters in param_path, or taken from scope. If nothing is provided, the parameters will be randomly initialized.

The scope argument is necessary because we want to share the scope between inferencer and trainer. One use case is GAN.

Removed params.py in #10354

wangkuiyi · 2018-05-02T18:06:51Z

python/paddle/fluid/params.py

+        # reference: save_persistables in io.py
+        pass
+
+    def add_params(self, scope):


I don't quite understand here -- the method name is add params, but the method comment says that it takes keys from the scope. Does it really add something?

We plan to remove this class, and add a merge method to core.Scope. It takes another scope, and set all the vars in the other scope to itself. If vars with the same name is present, its own vars will be overridden.

For the Scope class, can we expose the vars and make it public? That might be necessary to implement the merge function.

We could use LocalVarNames to do the iteration as well. So if we do want to hide the vars_, we can still do that.

@jetfuel maybe we can implement merge in the C++? how do you think?

Removed params.py in #10354

@helinwang Yes, it should be in the C++. My approach is to use the LocalVarNames and check one by one.

void Scope::merge(Scope* scope) const { std::vector<std::string> names = scope.LocalVarNames; for (auto name_it = names.begin(); name_it != names.end(); name_it ++) { auto it = vars_.find(*name_it); if (it == vars_.end()) { vars_[*name_it] = scope.Var(*name_it) } } }

@jetfuel Thanks! Every var in the scope argument will be put into the receiving scope, if the receiving scope already has the same name, it will be replaced. So I guess we can just do:

void Scope::merge(Scope* scope) const { std::vector<std::string> names = scope.LocalVarNames; for (auto name_it = names.begin(); name_it != names.end(); name_it ++) { vars_[*name_it] = scope.Var(*name_it) } }

Sorry the spec was different from what we discussed yesterday (yesterday's was to not replace if already exists).

wangkuiyi · 2018-05-02T18:07:21Z

python/paddle/fluid/trainer.py

+class Event(object):
+    BEGIN_EPOCH = 0
+    END_EPOCH = 1
+    BEGIN_STEP = 2


Is the "step" here "iteration"?

I think step is more commonly used in deep learning. In the same time iteration is more often used in programming languages. Since here is the event for training, I think the user would be more familiar with "step".

wangkuiyi · 2018-05-02T18:08:32Z

python/paddle/fluid/trainer.py

+        self.place = place
+        # TODO(helin): support distributed training
+
+    def train(self, reader, num_epochs, event_handler):


I don't quite remember how do we define a pass, and epoch, with our current reader design? If a reader reads gRPC requests, it would never end and would have no pass separations, right?

A call to the reader will return a iterator that contains a single pass of data. Yes, if a reader reads gRPC requests, it would never end and would have no pass separations.

wangkuiyi · 2018-05-02T18:09:51Z

python/paddle/fluid/trainer.py

+        self.type = Event.BEGIN_EPOCH
+
+
+class Trainer(object):


Is this a base class, which implies that we need derived classes like MultiGPUTrainer, MultiNodeTrainer, EDLTrainer, etc, or it is a plain class and both single-node and multi-node training logics should be implemented in it?

The current plan is "a plain class and both single-node and multi-node training logics should be implemented in it", the logic is controlled by environment variable. Here is an example: #10316

helinwang and others added 3 commits May 1, 2018 13:51

scaffolding for the new Fluid API

a2ffbd5

add comments

6d58d6d

remove enum reference from trainer

10cb942

helinwang requested a review from cs2be May 1, 2018 22:40

cs2be previously approved these changes May 1, 2018

View reviewed changes

helinwang requested review from cs2be, abhinavarora, jetfuel, daming-lu, sidgoyal78 and reyoung May 1, 2018 22:41

fix yapf stype check

392a9dd

helinwang dismissed cs2be’s stale review via 392a9dd May 1, 2018 22:51

improve comments

b5dd215

cs2be approved these changes May 1, 2018

View reviewed changes

helinwang requested review from wangkuiyi, JiayiFeng and jacquesqiao May 1, 2018 23:29

helinwang merged commit d1f9959 into develop May 1, 2018

helinwang deleted the new_api_scaffolding branch May 1, 2018 23:55

wangkuiyi reviewed May 2, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fluid new API scaffolding #10313

Fluid new API scaffolding #10313

helinwang commented May 1, 2018

wangkuiyi May 2, 2018

helinwang May 2, 2018

helinwang May 2, 2018 •

edited

Loading

wangkuiyi May 2, 2018

helinwang May 2, 2018

helinwang May 2, 2018 •

edited

Loading

wangkuiyi May 2, 2018

helinwang May 2, 2018

jetfuel May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading

jetfuel May 2, 2018

helinwang May 2, 2018 •

edited

Loading

wangkuiyi May 2, 2018

helinwang May 2, 2018

wangkuiyi May 2, 2018

helinwang May 2, 2018

wangkuiyi May 2, 2018

helinwang May 2, 2018

Fluid new API scaffolding #10313

Fluid new API scaffolding #10313

Conversation

helinwang commented May 1, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang May 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang May 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jetfuel May 2, 2018 • edited Loading

Choose a reason for hiding this comment

helinwang May 2, 2018 • edited Loading

Choose a reason for hiding this comment

helinwang May 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang May 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading

jetfuel May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading

helinwang May 2, 2018 •

edited

Loading