Tags · rail-berkeley/rlkit

v0.2.1

Change sampling method from randint to choice in Replay and robustify…

… policy networks in SAC (#111)

* Introduced possibility to change alpha parameter

* Fix sum operation which causes trouble for more that two batch dimensions

* Replace randint with choice to avoid duplicates

* Added replace as an option to the replay buffer and a warning if desired behaviour is not possible

Aug 10, 2020
55ace41
zip
tar.gz

v0.2.0

Initial v0.2.0 code

Apr 6, 2019
ae49265
zip
tar.gz

v0.1.2

Saved version before v0.2 with RIG and HER

Apr 4, 2019
86db9c2
zip
tar.gz

v0.1

Initial version built off of pytorch v0.3

Oct 16, 2018
838ad1a
zip
tar.gz

v0.1.1

upgrade to MuJoCo 1.5

May 8, 2018
2c8561d
zip
tar.gz

v0.1.0

First tagged version

May 8, 2018
e9ea00a
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.2.1

v0.2.0

v0.1.2

v0.1

v0.1.1

v0.1.0

Tags: rail-berkeley/rlkit

v0.2.1

v0.2.0

v0.1.2

v0.1

v0.1.1

v0.1.0