Skip to content

v0.0.4

Compare
Choose a tag to compare
@puyuan1996 puyuan1996 released this 21 Feb 10:16
· 67 commits to main since this release

Enhancement

  1. add agent configurations & polish replay video saving method (#184)
  2. polish comments in worker files
  3. polish comments in tree search files (#185)
  4. rename mcts_mode to battle_mode_in_simulation_env, add sampled alphazero config for tictactoe (#179)
  5. polish redundant data squeeze operations (#177)
  6. polish the continuous action process in sez model
  7. polish bipedalwalker env

Fix

  1. fix completed value inf bug when zero exists in action_mask in gumbel muzero (#178)
  2. fix render settings when using gymnasium (#173)
  3. fix lstm_hidden_size in sampled_efficientzero_model.py
  4. fix action_mask in bipedalwalker_cont_disc_env, fix device bug in sampled efficientzero (#168)

Full Changelog: v0.0.3...v0.0.4

Contributors: @karroyan @HarryXuancy @puyuan1996 @zjowowen