Release v0.4 #884

awjuliani · 2018-06-16T00:35:41Z

No description provided.

No game object is named "Platform" therefore the legs never make contact with anything

* Report means instead of totals for losses. * Report absolute loss for policy.

* [Fix] Use the stored agent info instead of the previous agent info when bootstraping the value * [Bug Fix] Addressed #643 * [Added Line Break]

…r the first episode (#657)

Quick start integrated into Installation

This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.

…n-video-link added the video link

Add ignore for plugins folder

* [Cold Fix] Split the way cummulative rewards and episode length are counted The reward is appended at each step to the cummulative reward The episode count is ONLY incremented when d_t+1 is false

* Add `Walker` example environment and documentation.

* First draft of Azure support docs * Correcting links to other docs * Adding additional links and cleaning instructions * Adding references to Azure docs in other appropriate places

added video links

@hsaikia

Fixes the issue raised by @hsaikia in #552 Added the memory_size variable to the BC model Added memory_size and recurrent_out to the output nodes of the graph when using BC with LSTM

- Indent the section about providing actions to multiple brains to be in line with the rest of the step() docs. - Move the line about what step() returns closer to the top of the docs so it's harder to overlook. - Add a small code snippet about how to get BrainInfo belonging to a specific brain and how to get data from that BrainInfo object.

* [Refactor] Fixed line indentation * Removed the library Newtonsoft.Json from the monitor * Replaced calls to JSON converstion with manual conversion * [Modified] The Monitor now has multiple * Log methods that take different object types

* [Added Ascii art on learn.py] Note : This is by far the best feature of 0.4

Some suggestions to avoid ambiguity

Shouldn't Done(); be placed after the rewards are given?

…#774)

Replaced UNITY ML AGENTS with the unity logo

* Fix BananaIL frozen agent material * [Fix] Added texture on the imitation learning scene and linked the models to the internal brains

Also made the look code into Update

…ly one brain

…l-bugfix [Hotfix] Remove code in PushAgentBasic that presumes that there is only one brain

* Added missing declaration to docs sample code. * Added pretrained model as default graph in Internal brain of Tennis scene * Disabled PlayerBrain in Tennis by default. * Removed accidental config.

…model-fix some hack to make windows save the model when do ctrl+c

…not been trained for them (#877)

* Remove extra bouncer brain hyperparameters * Add error when using curiosity+odd

eshvk · 2018-06-16T00:38:05Z

:+1

xiaomaogy

+1

sterlingcrispin and others added 30 commits April 19, 2018 11:20

CrawlerLegContact incorrectly refers to Ground (#589)

49b8e88

No game object is named "Platform" therefore the legs never make contact with anything

Report means instead of totals for losses (#580)

5bc739d

* Report means instead of totals for losses. * Report absolute loss for policy.

Explicitly document Basics.ipynb location.

8e2dc3d

Update Python-API.md

9946593

Hotfix 0.3.1b (#656)

4d9ad52

* [Fix] Use the stored agent info instead of the previous agent info when bootstraping the value * [Bug Fix] Addressed #643 * [Added Line Break]

[Cold Fix] Making the episode length and mean reward more accurate fo…

4e116a3

…r the first episode (#657)

refactored the quick start and installation guide, added faq

e6da141

Merge pull request #599 from Unity-Technologies/docs-refactor

346acb1

Quick start integrated into Installation

added the video link

defcc2d

resolved comments

8eca76e

Merge pull request #703 from Unity-Technologies/develop-docs-imitatio…

eacf2a3

…n-video-link added the video link

Add ignore for plugins folder

f9b667e

Merge pull request #730 from Unity-Technologies/develop-ignore-plugins

ae512a0

Add ignore for plugins folder

Develop fix cumulative reward (#725)

48f2d15

* [Cold Fix] Split the way cummulative rewards and episode length are counted The reward is appended at each step to the cummulative reward The episode count is ONLY incremented when d_t+1 is false

Walker Environment (#720)

8a59fe3

* Add `Walker` example environment and documentation.

Develop docs azure (#744)

3f000d9

* First draft of Azure support docs * Correcting links to other docs * Adding additional links and cleaning instructions * Adding references to Azure docs in other appropriate places

added video links

4e502e5

Merge pull request #772 from Unity-Technologies/develop-doc-videolinks

54eafa1

added video links

[Fixed BC with LSTM] (#766)

ec55977

Fixes the issue raised by @hsaikia in #552 Added the memory_size variable to the BC model Added memory_size and recurrent_out to the output nodes of the graph when using BC with LSTM

* Add benchmark thresholds for example environments

f489dc2

Fix Explicit Documentation Issue (#776)

ffdad74

Monitor without JSON Conversion (#724)

e18938b

* [Refactor] Fixed line indentation * Removed the library Newtonsoft.Json from the monitor * Replaced calls to JSON converstion with manual conversion * [Modified] The Monitor now has multiple * Log methods that take different object types

[Added Ascii art on learn.py] (#727)

82daedd

* [Added Ascii art on learn.py] Note : This is by far the best feature of 0.4

Update Learning-Environment-Create-New.md (#769)

4537c4f

Some suggestions to avoid ambiguity

Update Learning-Environment-Create-New.md (#770)

f5aa0c5

Shouldn't Done(); be placed after the rewards are given?

[Update Curriculum for WallJump] Updating the curriculum for WallJump (…

08f93f1

…#774)

Newer Ascii Art (#780)

b5c1511

Replaced UNITY ML AGENTS with the unity logo

Fixed path variables for Anacondo3

f07ae99

awjuliani and others added 19 commits June 15, 2018 14:50

Fix for “missing” pretrained model (#862)

e561f75

Fix typo (#864)

4d26483

Make ball opaque (#870)

7ebb7b8

Fix BananaIL frozen agent material (#865)

ee535eb

* Fix BananaIL frozen agent material * [Fix] Added texture on the imitation learning scene and linked the models to the internal brains

Replaced Collision enter with collision stay (#867)

cff4ad4

[Removed log what lookDir is 0 in Bouncer] (#861)

060a9ef

Also made the look code into Update

[Hotfix] Remove code in PushAgentBasic that presumes that there is on…

3f855c4

…ly one brain

Merge pull request #872 from Unity-Technologies/release-v0.4-hallwayi…

579a33a

…l-bugfix [Hotfix] Remove code in PushAgentBasic that presumes that there is only one brain

Several final improvement to docs, scene and configs. (#871)

967bdf2

* Added missing declaration to docs sample code. * Added pretrained model as default graph in Internal brain of Tennis scene * Disabled PlayerBrain in Tennis by default. * Removed accidental config.

Fix wall material (#874)

b191a88

Fix for visual observation w/ curiosity (#873)

5aca9a6

some hack to make windows save the model when do ctrl+c

ebf5b79

Update document (#875)

217b6cd

Merge pull request #876 from Unity-Technologies/release-windows-save-…

81fe4da

…model-fix some hack to make windows save the model when do ctrl+c

removed the internal brain from all the visual scenes, since they've …

3b06c61

…not been trained for them (#877)

Fix for Discrete observations + Curiosity (#866)

1f39ddf

change the visual scenes to default to player brain (#879)

2520a79

Replaced message printed in Python and in documentation. (#881)

906c372

Error message when using ODD and Curiosity (#883)

9ca939b

* Remove extra bouncer brain hyperparameters * Add error when using curiosity+odd

awjuliani requested review from mmattar, eshvk, xiaomaogy and vincentpierre June 16, 2018 00:35

Merge branch 'master' into release-v0.4

562d49d

eshvk approved these changes Jun 16, 2018

View reviewed changes

xiaomaogy approved these changes Jun 16, 2018

View reviewed changes

xiaomaogy merged commit 20569f9 into master Jun 16, 2018

awjuliani deleted the release-v0.4 branch June 29, 2018 18:04

github-actions bot locked as resolved and limited conversation to collaborators May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release v0.4 #884

Release v0.4 #884

Uh oh!

awjuliani commented Jun 16, 2018

Uh oh!

eshvk commented Jun 16, 2018

Uh oh!

xiaomaogy left a comment

Uh oh!

Uh oh!

Release v0.4 #884

Release v0.4 #884

Uh oh!

Conversation

awjuliani commented Jun 16, 2018

Uh oh!

eshvk commented Jun 16, 2018

Uh oh!

xiaomaogy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!