-
Notifications
You must be signed in to change notification settings - Fork 65
Add R2D1 agents #248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add R2D1 agents #248
Conversation
Change grandiosely used functions Fix zero-padding & torch contiguous Fix zero-padding & Change indices sampling function Change hyperparameters Remove redundant codes Add CNN compatibility to R2D1Agent Remove redundant code Implement rlpyt forward style Add previous_action & previous_reward GRU input structure Fix error Fix prev_action bug & Use make_one_hot function Fix error Update descriptions & move leading_dims functions to helper_functions.py Move valid_from_done from R2D1Loss to helper_functions.py Fix parameters r2d1_iqn loss & agent Fix GRUBrain compatible with c51 Add R2D1C51Loss Add r2d1_c51 configs Fix priority > 0 assert error Change parameters
Change parameters Add total_step to wandb log Add upndown env & configs Fix test score Fix test score sum to mean Add total step to recurrent dqn_agent Fix test log position Add framestack argument Remove upndown environment Fix no_framestack argument Add r2d1 resnet configs Delete lunarlander iqn & Fix R2D1C51 lunarlander config description Fix configs Change total_step count startpoint after warmup Chage test startpoint Fix epsilon decay Change r2d1 agent epsilon_decay Fix several issues commented
This pull request introduces 8 alerts when merging 1d3cf20 into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 8 alerts when merging d3a2e59 into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 8 alerts when merging 083a447 into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 8 alerts when merging c3056b3 into 815a1ca - view on LGTM.com new alerts:
|
This pull request introduces 8 alerts when merging abcc34b into 815a1ca - view on LGTM.com new alerts:
|
Add R2D1 agents which can be combined with q-learning algorithms(DQN, C51, IQN).
IQN(blue) vs R2D1IQN(red)

C51(blue) vs R2D1C51(red)

DQN(blue) vs R2D1DQN(red)

IQN ResNet(blue) vs R2D1DQN ResNet(red)

lunarlander C51(blue) vs R2D1C51(red)
