speedup accessing actions one agent at a time #4261

chriselion · 2020-07-22T22:40:14Z

Proposed change(s)

This showed up as a non-trivial portion of time (14s / 204s) when profiling with cprofile.

Real-life profiling with 3Dball:

master
2020-07-22 13:27:45 INFO [stats.py:112] 3DBall: Step: 492000. Time Elapsed: 193.277 s Mean Reward: 100.000. Std of Reward: 0.000. Training.

optimized
2020-07-22 14:56:14 INFO [stats.py:112] 3DBall: Step: 492000. Time Elapsed: 187.601 s Mean Reward: 100.000. Std of Reward: 0.000. Training.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

ervteng · 2020-07-23T22:31:49Z

ml-agents/mlagents/trainers/policy/tf_policy.py

@@ -353,6 +353,16 @@ def retrieve_previous_action(self, agent_ids: List[str]) -> np.ndarray:
                action_matrix[index, :] = self.previous_action_dict[agent_id]
        return action_matrix

+    def retrieve_previous_action_single(self, agent_id: str) -> np.ndarray:


I think this method needs to be moved to policy.py with the other retrieve methods (#4254).

ervteng · 2020-07-23T22:59:57Z

ml-agents/mlagents/trainers/agent_processor.py

@@ -137,7 +137,7 @@ def _process_step(
                action_pre = None
            action_probs = stored_take_action_outputs["log_probs"][idx]
            action_mask = stored_decision_step.action_mask
-            prev_action = self.policy.retrieve_previous_action([global_id])[0, :]
+            prev_action = self.policy.retrieve_previous_action_single(global_id)


Would you be opposed to renaming the method retrieve_previous_action and the old method to retrieve_previous_actions? Seems more descriptive and more similar to the methods in the LL-API.

speedup accessing actions one agent at a time

a0da66a

chriselion requested a review from ervteng July 22, 2020 22:40

ervteng reviewed Jul 23, 2020

View reviewed changes

chriselion closed this Dec 8, 2020

chriselion deleted the MLA-1173-retrieve_previous_action branch December 18, 2020 23:56

github-actions bot locked as resolved and limited conversation to collaborators Dec 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

speedup accessing actions one agent at a time #4261

speedup accessing actions one agent at a time #4261

Uh oh!

chriselion commented Jul 22, 2020 •

edited

Loading

Uh oh!

ervteng Jul 23, 2020

Uh oh!

ervteng Jul 23, 2020

Uh oh!

Uh oh!

speedup accessing actions one agent at a time #4261

speedup accessing actions one agent at a time #4261

Uh oh!

Conversation

chriselion commented Jul 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

Uh oh!

ervteng Jul 23, 2020

Choose a reason for hiding this comment

Uh oh!

ervteng Jul 23, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chriselion commented Jul 22, 2020 •

edited

Loading