-
Notifications
You must be signed in to change notification settings - Fork 129
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
polish(pu): polish efficiency and performance on atari and DMC, add m…
…uzero_segment_collector (#292) * polish(pu): polish quantize_state_hash and deepcopy * fix(pu): fix np.array dtype bug in buffer * polish(pu): use 0 deepcopy in kv_cache operation in collect/eval phase of unizero * polish(pu): use custom deepcopy for kv_cache * polish(pu): use value_array rather than value_list in compute_target_value * polish(pu): optimize compute_target_policy_non_re * polish(pu): optimize kv_caching update() * polish(pu): kv_cache_dict no to_cpu * polish(pu): optimize custom kv_cache copy * polish(pu): kv_cache_dict no to_cpu * feature(pu): add unizero ddp config * fix(pu): fix unizero ddp * sync code * polish(pu): use de kv_cacheepcopy only in recur_infer load * sync code * polish(pu): polish suz dmc config * sync code * polish(pu): use share_polol for kv_cache in recurrent_inference and use _copy rather than clone * polish(pu): all kv_cache copy use predefined share_pool * polish(pu): unuse decoder_net and lpips in ddp config * sync code * feature(pu): add dmc save_replay_gif option * sync code * polish(pu): polish sampled muzero ctree * test(pu): add sac cheetah config * fix(pu): fix render_image in dmc_env * fix(pu): fix reanalyze in sampled unizero * polish(pu): polish policy projector * feature(pu): add muzero_segment_collector.py * polish(pu): use uniform prior in ucb_score of suz mcts * fix(pu): fix self.action_mask_dict init bug * test(pu): use clamp0.9->1 * polish(pu): polish suz * fix(pu): fix muzero_segment_collector * fix(pu): uz target-value obs also use aug when use_aug=True * sync code * fix(pu): fix last_game_segment bug in muzero_segment_collector.py * fix(pu): one episode done then return in muzero_segment_collector.py * fix(pu): fix muzero_collector * polish(pu): polish unizero config and polish sample from segments * fix(pu): fix reanalyze in uz * polish(pu): add batch config and bash * polish(pu): polish uz configs * feature(pu): add unizero buffer_reanalyze variant * fix(pu): fix uz reanalyze_buffer * polish(pu): polish configs * feature(pu): add atari_muzero_segment_config * fix(pu): fix sampled_unizero reanalyze_policy * polish(pu):polish configs * polish(pu):polish suz configs * polish(pu):polish configs * fix(pu): fix root value in suz buffer * fix(pu): fix suz ctree * polish(pu): polish uz related configs, segment collector, train_entry * polish(pu): polish unizero world_model * polish(pu): polish reanalyze in buffer * fix(pu): fix entry import and nparray object bug in buffer * polish(pu): polish configs * polish(pu): polish configs * polish(pu): fix collector, polish configs * fix(pu): fix truncation segment sample in buffer * fix(pu): fix segment sample for uz in buffer * fix(pu): use origin buffer * fix(pu): fixvaluebugV8 * sync code * fix(pu): fix target action when calculating bootstrap value in unizero * fix(pu): fix target-action in sampled_unizero buffer * polish(pu): delete wrongly added files * polish(pu): polish entry/buffer/ctree, and fix index+1 bug in compute_target_reward_value * polish(pu): polish buffer and config * polish(pu): rename train_xxx_reanalyze to train_xxx_segment * polish(pu): polish world_model * polish(pu): polish entry comments * fix(pu): fix reward shape bug in dmc * fix(pu): polish sample_orig_reanalyze_batch and fix sample_orig_data due to child_visits bug * polish(pu): polish comments in _sample_orig_reanalyze_batch * fix(pu): add pad_action_lst in muzero_collector * polish(pu): polish dmc suz configs * fix(pu): fix reanalyzed_root_sampled_action in suz buffer * fix(pu): fix logp calculation in mcts expand, use clamp_limit for sampled actions, use 1e5 as total train steps in cos_lr_decay, polish cont policy loss * polish(pu): sample init position from the whole segment * fix(pu): fix empirical_distribution_type compare bug and half sampled actions sampled from a flatten gaussian * polish(pu): polish config and reward shape * polish(pu): polish memory config * polish(pu): polish config * polish(pu): polish config and comments * polish(pu): polish comments * polish(pu): polish comments and config * polish(pu): delete unused config * polish(pu): polish comments and docstring in unizero --------- Co-authored-by: PaParaZz1 <niuyazhe314@outlook.com> Co-authored-by: dyyoungg <yangdeyu@sensetime.com> Co-authored-by: jiayilee65 <jiayilee65@163.com>
- Loading branch information
1 parent
d27f29a
commit dd7a5eb
Showing
185 changed files
with
4,001 additions
and
1,268 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.