[DO NOT MERGE] New memory abstraction and AMRL implementation #4374

ervteng · 2020-08-17T23:56:10Z

Proposed change(s)

New memory layer abstraction that lets a memory define it's memory size (w.r.t. the "memory size" specified in the config). Then the ActorCritic has a memory_size property which the Policy uses to set it's internal m_size. This allows the actual m_size that is used during training to be different than what is specified in the YAML. We might want to do this, for instance, if we want memory_size: 64 to mean the Policy has a memory size of 64, but the Critic network has memory size of 64 (total of 128) - so the Policy object can store both during training, but only export 64 for inference.

Also an implementation of https://openreview.net/forum?id=Bkl7bREtDr using this abstraction.

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

ervteng · 2020-09-14T17:29:28Z

Outdated - develop-amrl has the latest experimental code.

Ervin Teng added 19 commits August 7, 2020 16:14

Running LSTM for SAC

d523c8c

Use correct half of memories

f2873b2

Fix policy memory storinig

b97b1e5

Fix SeparateActorCritic and add test

cd509dd

Merge branch 'develop-add-fire' into develop-add-fire-sac-lst

c66ecba

Use loss masks in PPO.

07bb4c0

Proper shape of masks

0a3c795

Proper mask mean for PPO

2337d15

Fix dtype for actions

1f69102

Proper initialization and SAC masking

c0a77f7

Experimental amrl layer

f404834

Add extra FF layer

beab310

Faster implementation

6fece65

Add comment

eac1dc9

Passthrough max

d2e31aa

Merge branch 'develop-add-fire' into develop-add-fire-amrl

37feeee

Memory size abstraction and fixes

bf485a2

Fix SeparateActorCritic

bd90e29

Fix SeparateActorCritic

7f4ea51

ervteng mentioned this pull request Aug 18, 2020

[add-fire] Memory class abstraction #4375

Merged

10 tasks

ervteng closed this Sep 14, 2020

ervteng deleted the develop-add-fire-amrl branch September 14, 2020 17:29

github-actions bot locked as resolved and limited conversation to collaborators Sep 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DO NOT MERGE] New memory abstraction and AMRL implementation #4374

[DO NOT MERGE] New memory abstraction and AMRL implementation #4374

Uh oh!

ervteng commented Aug 17, 2020 •

edited

Loading

Uh oh!

ervteng commented Sep 14, 2020

Uh oh!

Uh oh!

[DO NOT MERGE] New memory abstraction and AMRL implementation #4374

[DO NOT MERGE] New memory abstraction and AMRL implementation #4374

Uh oh!

Conversation

ervteng commented Aug 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Types of change(s)

Checklist

Other comments

Uh oh!

ervteng commented Sep 14, 2020

Uh oh!

Uh oh!

ervteng commented Aug 17, 2020 •

edited

Loading