Clean up random seed code #3645

chriselion · 2020-03-16T23:57:14Z

Proposed change(s)

Remove some "seed pool" code that was (almost) unreachable
Change how seeds as determined for multiple environments.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

https://jira.unity3d.com/browse/MLA-766
https://www.johndcook.com/blog/2016/01/29/random-number-generator-seed-mistakes/

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

chriselion · 2020-03-16T23:58:04Z

ml-agents/mlagents/trainers/learn.py

    docker_target_name: Optional[str],
    no_graphics: bool,
-    seed: Optional[int],
+    seed: int,


This is always non-None.

chriselion · 2020-03-17T00:00:01Z

ml-agents/mlagents/trainers/learn.py

        worker_id: int, side_channels: List[SideChannel]
    ) -> UnityEnvironment:
-        env_seed = seed
-        if not env_seed:


This would only happen if you set --seed=0 or 1/10000 times if this block

ml-agents/ml-agents/mlagents/trainers/learn.py

Lines 492 to 494 in 811825b

if options.seed == -1:

run_seed = np.random.randint(0, 10000)

run_training(run_seed, options)

picked 0 for the seed.

np.random's seed isn't set at this point, so if you set seed=0 you wouldn't get reproducible results.

chriselion · 2020-03-17T00:02:01Z

ml-agents/mlagents/trainers/learn.py

-        if not env_seed:
-            env_seed = seed_pool[worker_id % len(seed_pool)]
+        # Make sure that each environment gets a different seed
+        env_seed = seed + worker_id


See also https://www.johndcook.com/blog/2016/01/29/random-number-generator-seed-mistakes/

I don't know how to prove which would be better or worse here - the environments should diverge after a few steps, but seeding them the same feels strange for multiple environments.

I think @harperj did some experiments with Walker and distributed training here - don't remember if there was a huge improvement but I remember the same-seeding being an issue in terms of diversity of experiences

Yeah, actually this code has been problematic for a while but I just never got around to cleaning it up. In practice for most environments it is not an issue. I can't think of any reason we'd practically want all of the environments to be the same seed, so far we have just been fortunate that there are other sources of randomness which make it fine either way.

chriselion · 2020-03-17T00:04:27Z

docs/Python-API.md

 - `worker_id` indicates which port to use for communication with the
  environment. For use in parallel training regimes such as A3C.
 - `seed` indicates the seed to use when generating random numbers during the
-  training process. In environments which do not involve physics calculations,


"no physics" does not imply "deterministic"

Agree. We called out physics before though just to let people know that the physics is non-determinstic. So just because you don't use a Random call in your code, it doesn't mean you are guaranteed deterministic.

The inverse of the statement is also not true: physics simulations are not inherently non-deterministic (although it sounds like Unity's implementation is).

harperj

LGTM

harperj · 2020-03-17T18:22:00Z

ml-agents/mlagents/trainers/learn.py

-        if not env_seed:
-            env_seed = seed_pool[worker_id % len(seed_pool)]
+        # Make sure that each environment gets a different seed
+        env_seed = seed + worker_id


Yeah, actually this code has been problematic for a while but I just never got around to cleaning it up. In practice for most environments it is not an issue. I can't think of any reason we'd practically want all of the environments to be the same seed, so far we have just been fortunate that there are other sources of randomness which make it fine either way.

remove obsolete code, offset worker seeds

3080e1e

chriselion commented Mar 16, 2020

View reviewed changes

chriselion commented Mar 17, 2020

View reviewed changes

chriselion requested a review from harperj March 17, 2020 00:16

harperj approved these changes Mar 17, 2020

View reviewed changes

chriselion merged commit 2625ba5 into master Mar 17, 2020

delete-merged-branch bot deleted the develop-MLA-766-random-seeds branch March 17, 2020 22:41

github-actions bot locked as resolved and limited conversation to collaborators May 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean up random seed code #3645

Clean up random seed code #3645

Uh oh!

chriselion commented Mar 16, 2020 •

edited by harperj

Loading

Uh oh!

chriselion Mar 16, 2020

Uh oh!

chriselion Mar 17, 2020

Uh oh!

chriselion Mar 17, 2020

Uh oh!

ervteng Mar 17, 2020

Uh oh!

harperj Mar 17, 2020

Uh oh!

chriselion Mar 17, 2020

Uh oh!

awjuliani Mar 17, 2020

Uh oh!

chriselion Mar 17, 2020

Uh oh!

awjuliani Mar 17, 2020

Uh oh!

harperj left a comment

Uh oh!

harperj Mar 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	if options.seed == -1:
	run_seed = np.random.randint(0, 10000)
	run_training(run_seed, options)

Clean up random seed code #3645

Clean up random seed code #3645

Uh oh!

Conversation

chriselion commented Mar 16, 2020 • edited by harperj Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harperj left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chriselion commented Mar 16, 2020 •

edited by harperj

Loading