-
Notifications
You must be signed in to change notification settings - Fork 65
Add R2D1 agents #244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add R2D1 agents #244
Conversation
Change grandiosely used functions Fix zero-padding & torch contiguous Fix zero-padding & Change indices sampling function Change hyperparameters Remove redundant codes Add CNN compatibility to R2D1Agent Remove redundant code Implement rlpyt forward style Add previous_action & previous_reward GRU input structure Fix error Fix prev_action bug & Use make_one_hot function Fix error Update descriptions & move leading_dims functions to helper_functions.py Move valid_from_done from R2D1Loss to helper_functions.py Fix parameters r2d1_iqn loss & agent Fix GRUBrain compatible with c51 Add R2D1C51Loss Add r2d1_c51 configs Fix priority > 0 assert error Change parameters
Change parameters Add total_step to wandb log Add upndown env & configs Fix test score Fix test score sum to mean Add total step to recurrent dqn_agent Fix test log position Add framestack argument Remove upndown environment Fix no_framestack argument Add r2d1 resnet configs Delete lunarlander iqn & Fix R2D1C51 lunarlander config description Fix configs Change total_step count startpoint after warmup Chage test startpoint Fix epsilon decay Change r2d1 agent epsilon_decay Fix several issues commented
This pull request introduces 10 alerts when merging bb37068 into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 10 alerts when merging 4066dbf into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 8 alerts when merging b7f8201 into 815a1ca - view on LGTM.com new alerts:
|
…algorithms into feature/add_r2d1_4
This pull request introduces 8 alerts when merging 803b114 into 815a1ca - view on LGTM.com new alerts:
|
Moved to #248 Due to my dumb push. |
Add R2D1 agents which can be combined with q-learning algorithms(DQN, C51, IQN).