[rllib] Add multi-agent examples for hand-coded policy, centralized VF #4554

ericl · 2019-04-03T18:18:04Z

Also, fix some bugs in mixing TF and non-TF policy graphs, and remove the deprecated compute_apply.

Linter

I've run scripts/format.sh to lint the changes in this PR.

AmplabJenkins · 2019-04-03T18:23:43Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/343/
Test PASSed.

AmplabJenkins · 2019-04-03T18:46:44Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/346/
Test PASSed.

AmplabJenkins · 2019-04-03T19:00:12Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13477/
Test FAILed.

AmplabJenkins · 2019-04-03T19:13:43Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13479/
Test FAILed.

ericl · 2019-04-03T20:28:39Z

python/ray/rllib/evaluation/policy_evaluator.py

-                        continue
-                    info_out[pid], _ = (
-                        self.policy_map[pid].learn_on_batch(batch))
+                builder = None


Otherwise it will crash if you are using TF but have a non-TF policy graph mixed.

AmplabJenkins · 2019-04-04T21:15:33Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/367/
Test PASSed.

AmplabJenkins · 2019-04-04T21:57:18Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13549/
Test FAILed.

…mples-ma

AmplabJenkins · 2019-04-05T05:35:37Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/383/
Test PASSed.

AmplabJenkins · 2019-04-05T06:05:12Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13582/
Test FAILed.

python/ray/rllib/examples/policy_evaluator_custom_workflow.py

richardliaw

consider adding examples to multi node tests

AmplabJenkins · 2019-04-06T23:41:56Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/387/
Test FAILed.

AmplabJenkins · 2019-04-07T00:13:32Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13602/
Test FAILed.

AmplabJenkins · 2019-04-07T03:01:19Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/392/
Test PASSed.

AmplabJenkins · 2019-04-07T03:23:57Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13612/
Test FAILed.

AmplabJenkins · 2019-04-07T19:22:19Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/394/
Test PASSed.

AmplabJenkins · 2019-04-07T21:39:44Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13621/
Test FAILed.

AmplabJenkins · 2019-04-08T20:35:42Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/398/
Test PASSed.

AmplabJenkins · 2019-04-08T23:01:26Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/13641/
Test FAILed.

ericl · 2019-04-09T07:36:38Z

Lint unrelated.

ericl added 3 commits April 3, 2019 11:11

ma

fc0e64a

random

fd7f617

wip

e0cd33b

ericl assigned richardliaw Apr 3, 2019

wip

9ababb9

ericl commented Apr 3, 2019

View reviewed changes

Update rllib-examples.rst

96fe6de

ericl added 5 commits April 4, 2019 22:25

metrics

dfeee31

Merge branch 'more-examples-ma' of github.com:ericl/ray into more-exa…

5249d9e

…mples-ma

new example

c44e034

update

4925da3

lint

6e5bc54

richardliaw reviewed Apr 6, 2019

View reviewed changes

python/ray/rllib/examples/policy_evaluator_custom_workflow.py Show resolved Hide resolved

richardliaw approved these changes Apr 6, 2019

View reviewed changes

ericl added 2 commits April 6, 2019 16:36

Merge remote-tracking branch 'upstream/master' into more-examples-ma

b6d68d6

test

c2a9599

lint

7a72ef0

Merge remote-tracking branch 'upstream/master' into more-examples-ma

0d79232

fix nie

ef74a07

fix

0b42456

ericl merged commit 4f46d3e into ray-project:master Apr 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Add multi-agent examples for hand-coded policy, centralized VF #4554

[rllib] Add multi-agent examples for hand-coded policy, centralized VF #4554

ericl commented Apr 3, 2019 •

edited

Loading

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

ericl Apr 3, 2019

AmplabJenkins commented Apr 4, 2019

AmplabJenkins commented Apr 4, 2019

AmplabJenkins commented Apr 5, 2019

AmplabJenkins commented Apr 5, 2019

richardliaw left a comment

AmplabJenkins commented Apr 6, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 8, 2019

AmplabJenkins commented Apr 8, 2019

ericl commented Apr 9, 2019

[rllib] Add multi-agent examples for hand-coded policy, centralized VF #4554

[rllib] Add multi-agent examples for hand-coded policy, centralized VF #4554

Conversation

ericl commented Apr 3, 2019 • edited Loading

Linter

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

AmplabJenkins commented Apr 3, 2019

ericl Apr 3, 2019

Choose a reason for hiding this comment

AmplabJenkins commented Apr 4, 2019

AmplabJenkins commented Apr 4, 2019

AmplabJenkins commented Apr 5, 2019

AmplabJenkins commented Apr 5, 2019

richardliaw left a comment

Choose a reason for hiding this comment

AmplabJenkins commented Apr 6, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 7, 2019

AmplabJenkins commented Apr 8, 2019

AmplabJenkins commented Apr 8, 2019

ericl commented Apr 9, 2019

ericl commented Apr 3, 2019 •

edited

Loading