[feature] Add experimental PyTorch support #4335

ervteng · 2020-08-11T18:47:40Z

Proposed change(s)

Experimental implementations of PPO, SAC, Curiosity, and GAIL based on PyTorch modules. Exports to .onnx and should support inference for all network architectures found in example environmeents. Use --torch in CLI or add framework: pytorch to trainer settings in YAML file to enable PyTorch backend.

Known limitations:

No GPU support
Does not currently save Curiosity and GAIL networks in checkpoints
Checkpoint manager does not currently manage ONNX files
SAC runs slower than in TF.

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

…elop-add-fire

# Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>

…gents into develop-add-fire-export

This reverts commit 3d7b809.

ervteng · 2020-08-20T01:39:08Z

docs/Learning-Environment-Examples.md

@@ -460,7 +460,7 @@ you would like to contribute environments, please see our
  head, thighs, shins, feet, arms, forearms and hands.
 - Goal: The agents must move its body toward the goal direction without falling.
  - `WalkerDynamic`- Goal direction is randomized.
-  - `WalkerDynamicVariableSpeed`- Goal direction and walking speed are randomized. 
+  - `WalkerDynamicVariableSpeed`- Goal direction and walking speed are randomized.


I've tried to remove this, this delta isn't picked up by git 👿

awjuliani and others added 30 commits April 14, 2020 15:05

Begin porting work

017c3cb

Add ResNet and distributions

d99fc74

Merge remote-tracking branch 'origin/master' into develop-add-fire

c981a81

Merge remote-tracking branch 'origin/master' into develop-add-fire

6dfb8fa

Dynamically construct actor and critic

a492a9f

Initial optimizer port

5e6f4ae

Refactoring policy and optimizer

7e46bc5

Resolving a few bugs

a3a1c0f

Share more code between tf and torch policies

652b399

Slightly closer to running model

da49aaa

Training runs, but doesn’t actually work

1ae28be

Fix a couple additional bugs

b68eb20

Add conditional sigma for distribution

5e39d84

Fix normalization

a0d6823

Merge remote-tracking branch 'origin/develop-add-fire-debug' into dev…

c807190

…elop-add-fire

Support discrete actions as well

e2d7fee

Continuous and discrete now train

f5b28d3

Mulkti-discrete now working

50f5cc1

Visual observations now train as well

8c10cd3

Merge remote-tracking branch 'origin/master' into develop-add-fire

8445661

GRU in-progress and dynamic cnns

deb6e92

Fix for memories

57486ab

Remove unused arg

f6d5df5

Combine actor and critic classes. Initial export.

5521670

Support tf and pytorch alongside one another

9b9e783

Prepare model for onnx export

e98def6

Merge remote-tracking branch 'origin/master' into develop-add-fire

d6d69ad

# Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py

Use LSTM and fix a few merge errors

2c6daac

Merge remote-tracking branch 'origin/master' into develop-add-fire

411d0c4

Fix bug in probs calculation

8b36db0

andrewcoh and others added 11 commits August 18, 2020 15:12

revert tests

6635413

Fix of the test for multi visual input

1d89489

Make reset block submodule

48e77c6

fix export input_name

6e75dd1

[add-fire] Memory class abstraction (#4375)

7660a90

make visual input channel first for export

4db512b

Merge branch 'develop-add-fire' into develop-add-fire-export

bd41761

Don't use torch.split in LSTM

47212e5

Add fire to test_simple_rl.py (#4378)

09c2dc3

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>

Merge branch 'develop-add-fire' of github.com:Unity-Technologies/ml-a…

b22f412

…gents into develop-add-fire-export

reverting unity_to_external_pb2_grpc.py

269a4c8

vincentpierre marked this pull request as ready for review August 19, 2020 21:05

vincentpierre changed the title ~~[DO NOT MERGE] Develop add fire~~ Develop add fire Aug 19, 2020

andrewcoh and others added 7 commits August 19, 2020 14:13

remove duplicate of curr documentation

3d7b809

Revert "remove duplicate of curr documentation"

1940d96

This reverts commit 3d7b809.

remove duplicated curriculum doc (#4386)

9406624

Fixed discrete models

0a8b5e0

Always export one Action tensor (#4388)

e6eb502

[add-fire] Revert unneeded changes back to master (#4389)

6f46b30

add comment

435d226

ervteng commented Aug 20, 2020

View reviewed changes

Ruo-Ping Dong and others added 3 commits August 19, 2020 22:25

fix test

1a15577

Fix export

38c1007

add fire clean up docstrings in create policies (#4391)

ddcf078

vincentpierre approved these changes Aug 20, 2020

View reviewed changes

andrewcoh approved these changes Aug 20, 2020

View reviewed changes

ervteng changed the title ~~Develop add fire~~ [feature] Add experimental PyTorch support Aug 20, 2020

[add-fire] Update changelog (#4397)

e93c746

vincentpierre merged commit 672b608 into master Aug 20, 2020

github-actions bot locked as resolved and limited conversation to collaborators Aug 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feature] Add experimental PyTorch support #4335

[feature] Add experimental PyTorch support #4335

Uh oh!

ervteng commented Aug 11, 2020 •

edited

Loading

Uh oh!

ervteng Aug 20, 2020 •

edited

Loading

Uh oh!

Uh oh!

[feature] Add experimental PyTorch support #4335

[feature] Add experimental PyTorch support #4335

Uh oh!

Conversation

ervteng commented Aug 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Types of change(s)

Checklist

Other comments

Uh oh!

ervteng Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ervteng commented Aug 11, 2020 •

edited

Loading

ervteng Aug 20, 2020 •

edited

Loading