-
Notifications
You must be signed in to change notification settings - Fork 6.7k
Fix A3C PyTorch implementation #2036
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
51901ad
Use F.softmax instead of a pointless network layer
alok 5d7fc19
Use correct pytorch functions
alok 8583616
Rename argument name to out_size
alok 18c4a4c
Fix shapes of tensors
alok 64ae2ab
Fmt
alok 8accdae
replace deprecated function
alok 8645cd7
rm unnecessary Variable wrapper
alok 47e8ebd
rm all use of torch Variables
alok f9e4797
Merge branch 'master' into fix-a3c-torch
alok 884a6a8
Ensure that values are flat list
alok 7d1b205
Fix shape error in conv nets
alok aeab1f3
Merge branch 'master' into fix-a3c-torch
alok 662eaa5
Merge branch 'master' into fix-a3c-2
alok f9561d3
fmt
alok 7f06a1f
Fix shape errors
alok 0438707
Add TODO
alok da8d9e6
Use correct filter size
alok db9804d
Add missing channel major
alok e865a09
Merge branch 'master' into fix-a3c-torch
alok 27cd897
Revert reshape of action
alok 75ea9a7
Squeeze action
alok 87ab87e
Squeeze actions along first dimension
alok 9acd029
try adding pytorch tests
richardliaw c4b8ca7
typo
richardliaw 6a79793
fixup docker messages
richardliaw 7cdedf3
Fix A3C for some envs
alok da414fc
fmt
alok 3b9234f
nit flake
richardliaw 9ddab77
small lint
richardliaw File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,10 @@ | ||
# The examples Docker image adds dependencies needed to run the examples | ||
|
||
FROM ray-project/deploy | ||
RUN conda install -y -c conda-forge tensorflow | ||
|
||
# This updates numpy to 1.14 and mutes errors from other libraries | ||
RUN conda install -y numpy | ||
RUN apt-get install -y zlib1g-dev | ||
RUN pip install gym[atari] opencv-python==3.2.0.8 | ||
RUN pip install gym[atari] opencv-python==3.2.0.8 tensorflow | ||
RUN pip install --upgrade git+git://github.com/hyperopt/hyperopt.git | ||
# RUN conda install -y -q pytorch torchvision -c soumith | ||
RUN conda install pytorch-cpu torchvision-cpu -c pytorch |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
clip_grad_norm is deprecated in favor of the underscore version, hence the change