-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Documentation for Goal conditioning #5149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
||
It is possible for agents to collect observations that will be treated as "goal". | ||
A goal is used to condition the policy of the Agent, meaning that if the goal | ||
changes, the behavior of the Agent will change as well. Note that this is true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should say that goals change the policy, which is a mapping from (observation -> action)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to refer to it as a "goal signal" instead of a goal?
Co-authored-by: Arthur Juliani <awjuliani@gmail.com>
Co-authored-by: Arthur Juliani <awjuliani@gmail.com>
@@ -560,6 +562,36 @@ between -1 and 1. | |||
of an entity to the `BufferSensor`. | |||
- Normalize the entities observations before feeding them into the `BufferSensor`. | |||
|
|||
### Goal Signals | |||
|
|||
It is possible for agents to collect observations that will be treated as "goal." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is possible for agents to collect observations that will be treated as "goal." | |
It is possible for agents to collect observations that will be treated as a "goal." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think within this paragraph we should also call these goal signals, and not goals.
Co-authored-by: Ervin T. <ervin@unity3d.com>
Proposed change(s)
Documentation for Goal conditioning
Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)
Types of change(s)
Checklist
Other comments