Skip to content

Change sampling method from randint to choice in Replay and robustify policy networks in SAC#111

Merged
vitchyr merged 5 commits intorail-berkeley:masterfrom ksluck:masterAug 10, 2020