[rllib] Add rock paper scissors multi-agent example #5336

ericl · 2019-08-01T01:05:14Z

What do these changes do?

This demonstrates running the following policies in competition:
(1) heuristic policy of repeating the same move
(2) heuristic policy of beating the last opponent move
(3) LSTM/feedforward PG policies
(4) LSTM policy with custom safety loss

Related issue number

Closes #4789

Linter

I've run scripts/format.sh to lint the changes in this PR.

AmplabJenkins · 2019-08-01T04:04:16Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15855/
Test PASSed.

AmplabJenkins · 2019-08-01T19:55:33Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15882/
Test FAILed.

AmplabJenkins · 2019-08-01T20:29:47Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15883/
Test PASSed.

ericl added 3 commits July 31, 2019 17:26

add example

7e90129

add to docs

35ae3e2

fix

818adbb

ericl assigned richardliaw Aug 1, 2019

legacy compat

7a9a007

ericl added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Aug 1, 2019

ericl added 4 commits August 1, 2019 10:24

Merge remote-tracking branch 'upstream/master' into better-ma-example

2a3ac00

Merge remote-tracking branch 'upstream/master' into better-ma-example

cc3995f

football

a28b8c2

warn if simple

7a73ab6

richardliaw approved these changes Aug 1, 2019

View reviewed changes

ericl merged commit 20450a4 into ray-project:master Aug 1, 2019

edoakes pushed a commit to edoakes/ray that referenced this pull request Aug 9, 2019

[rllib] Add rock paper scissors multi-agent example (ray-project#5336)

cb0dd24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Add rock paper scissors multi-agent example #5336

[rllib] Add rock paper scissors multi-agent example #5336

ericl commented Aug 1, 2019

AmplabJenkins commented Aug 1, 2019

AmplabJenkins commented Aug 1, 2019

AmplabJenkins commented Aug 1, 2019

[rllib] Add rock paper scissors multi-agent example #5336

[rllib] Add rock paper scissors multi-agent example #5336

Conversation

ericl commented Aug 1, 2019

What do these changes do?

Related issue number

Linter

AmplabJenkins commented Aug 1, 2019

AmplabJenkins commented Aug 1, 2019

AmplabJenkins commented Aug 1, 2019