Skip to content

Add R2D1 agents #244

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 22 commits into from
Closed

Add R2D1 agents #244

wants to merge 22 commits into from

Conversation

jinPrelude
Copy link
Collaborator

Add R2D1 agents which can be combined with q-learning algorithms(DQN, C51, IQN).

Change grandiosely used functions

Fix zero-padding & torch contiguous

Fix zero-padding & Change indices sampling function

Change hyperparameters

Remove redundant codes

Add CNN compatibility to R2D1Agent

Remove redundant code

Implement rlpyt forward style

Add previous_action & previous_reward GRU input structure

Fix error

Fix prev_action bug & Use make_one_hot function

Fix error

Update descriptions & move leading_dims functions to helper_functions.py

Move valid_from_done from R2D1Loss to helper_functions.py

Fix parameters

r2d1_iqn loss & agent

Fix GRUBrain compatible with c51

Add R2D1C51Loss

Add r2d1_c51 configs

Fix priority > 0 assert error

Change parameters
Change parameters

Add total_step to wandb log

Add upndown env & configs

Fix test score

Fix test score sum to mean

Add total step to recurrent dqn_agent

Fix test log position

Add framestack argument

Remove upndown environment

Fix no_framestack argument

Add r2d1 resnet configs

Delete lunarlander iqn & Fix R2D1C51 lunarlander config description

Fix configs

Change total_step count startpoint after warmup

Chage test startpoint

Fix epsilon decay

Change r2d1 agent epsilon_decay

Fix several issues commented
@jinPrelude jinPrelude added the enhancement New feature or request label Jun 18, 2020
@jinPrelude jinPrelude requested review from MrSyee and cyoon1729 June 18, 2020 07:36
@jinPrelude jinPrelude self-assigned this Jun 18, 2020
@jinPrelude jinPrelude mentioned this pull request Jun 18, 2020
@lgtm-com
Copy link

lgtm-com bot commented Jun 18, 2020

This pull request introduces 10 alerts when merging bb37068 into 815a1ca - view on LGTM.com

new alerts:

  • 4 for Mismatch between signature and use of an overridden method
  • 2 for Signature mismatch in overriding method
  • 2 for __init__ method calls overridden method
  • 1 for Module is imported with 'import' and 'import from'
  • 1 for Redundant assignment

@lgtm-com
Copy link

lgtm-com bot commented Jun 22, 2020

This pull request introduces 10 alerts when merging 4066dbf into 815a1ca - view on LGTM.com

new alerts:

  • 4 for Mismatch between signature and use of an overridden method
  • 2 for Signature mismatch in overriding method
  • 2 for __init__ method calls overridden method
  • 1 for Module is imported with 'import' and 'import from'
  • 1 for Redundant assignment

@lgtm-com
Copy link

lgtm-com bot commented Jun 22, 2020

This pull request introduces 8 alerts when merging b7f8201 into 815a1ca - view on LGTM.com

new alerts:

  • 4 for Mismatch between signature and use of an overridden method
  • 2 for Signature mismatch in overriding method
  • 1 for Module is imported with 'import' and 'import from'
  • 1 for __init__ method calls overridden method

@lgtm-com
Copy link

lgtm-com bot commented Jun 23, 2020

This pull request introduces 8 alerts when merging 803b114 into 815a1ca - view on LGTM.com

new alerts:

  • 4 for Mismatch between signature and use of an overridden method
  • 2 for Signature mismatch in overriding method
  • 1 for Module is imported with 'import' and 'import from'
  • 1 for __init__ method calls overridden method

@jinPrelude
Copy link
Collaborator Author

Moved to #248 Due to my dumb push.

@jinPrelude jinPrelude closed this Jun 23, 2020
@jinPrelude jinPrelude deleted the feature/add_r2d1_4 branch June 23, 2020 06:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant