-
Notifications
You must be signed in to change notification settings - Fork 6.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[wingman -> rllib] Improved stats changes in AsyncSamplesOptimizer #3966
Conversation
Test FAILed. |
@@ -85,7 +84,9 @@ def _init(self, | |||
self.timers = {k: TimerStat() for k in ["train", "sample"]} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does it also make sense to remove the timers?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well, I think so. Replaced self.timers with self._optimizer_step_timer.
Test FAILed. |
Test FAILed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just tried out it, looks good.
Test PASSed. |
Test PASSed. |
Test FAILed. |
Pushed some lint fixes. Note that num remote_evaluators should be >0 still, since async samples optimizer doesn't have fallback code for 0 workers case like other optimizers do. |
Test FAILed. |
"num_weight_syncs": self.num_weight_syncs, | ||
"num_steps_replayed": self.num_replayed, | ||
"timing_breakdown": timing, | ||
"learner_queue": self.learner.learner_queue_size.stats(), | ||
} | ||
}, **self.get_mean_stats_and_reset()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This syntax is needed for python 2.
Test FAILed. |
jenkins retest this please |
Test PASSed. |
Merged, thanks! |
Stats gathering in AsyncSamplesOptimizer is improved, as samples_throughput and train_throughput were giving bad values (mostly 0, with periodical peaks, instead of a steady graph showing throughput per second).