Always return full state dict (even when only one agent) #328

sethmnielsen · 2019-08-31T02:01:37Z

Describe the bug
In some cases, generic code that is meant to be compatible with any of the Holodeck worlds is required (more specifically for my application: any of the worlds that contain a UavAgent). Currently, it is not possible to access the state after ticking with the same syntax for all the worlds containing UavAgent. This is because some worlds have multiple agents while others have only one, and the dictionary returned for multiple agents is agent name to sensor, but for a single agent world it is a sensor to data dict.

I understand that the setup code for the two different world types needs to be different, but in my case, I have separated the setup code from the main script, and the setup code is different depending on which world is specified. My main script, however, contains a lot of logic and operations that are meant to work for a UavAgent in any world, and I need to access the state multiple times throughout.

To Reproduce
The following code works fine for a multi-agent world:

uav_state = env.tick()['uav0']

But fails if the world has only one agent.

Expected behavior
It seems to me that the syntax should be consistent regardless of the number of agents. I don't see a strong need for a different interface between one or multiple agents (other than for slightly simpler code in the single-agent case), and I am of the opinion that the state should simply always return as an agent name to sensor dict.

I tested changing the following lines of environments.py (in the reset function) from

if self.num_agents == 1:
    self._default_state_fn = self._get_single_state
else:
    self._default_state_fn = self._get_full_state

to

self._default_state_fn = self._get_full_state

and the code for accessing the state in a multi-agent world (the Ocean world) had no problems accessing the state in a single-agent world (UrbanCity) as well.

I realize I may be missing something and am open to other solutions.

Version Information:

OS: Ubuntu
Version: 18.04
Holodeck Version: 0.2.2dev (boat-freeze branch)
World/Scenario version: 0.2.2dev

Additional context
Also, I noticed that self._default_state_fn is set in both the __init__ and reset functions of HolodeckEnvironment. Isn't one or the other redundant?

The text was updated successfully, but these errors were encountered:

sethmnielsen · 2019-08-31T23:28:51Z

After further testing, I found that the following code works in both situations, without modifying the python source code:

uav = env.agents['uav0']
env.tick()
state = uav.agent_state_dict

Should I just stick with this workaround?

FYI, it took a good while of searching through the code to find this solution. I don't see the agents dict in the documentation, and the agent_state_dict was kind of hidden. So if this is a good solution, then maybe it should be made clearer in the examples/docs?

nickwalton · 2019-09-05T21:19:57Z

Hi Seth,

Good point it would probably make sense to make one agent and multi agent syntax the same. We'll look into what the best fix is.

jaydenmilne · 2019-09-11T17:48:19Z

I believe that it was done this way so that with one agent the API would be the same as OpenAI Gym, since that is one of the goals of Holodeck. Since there is no standard multiagent API in Gym, the other method was used.

nickwalton · 2019-09-11T19:19:34Z

We discussed and decided that while env.step() should follow openai GYM api. But env.tick() should behave the same whether its a single or multi agent world @allisoncl8

daniekpo · 2019-10-24T16:16:59Z

@nickwalton, can we close this issue?

sethmnielsen added the bug label Aug 31, 2019

nickwalton assigned allisoncl8 Sep 11, 2019

vinhowe unassigned allisoncl8 Apr 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always return full state dict (even when only one agent) #328

Always return full state dict (even when only one agent) #328

sethmnielsen commented Aug 31, 2019

sethmnielsen commented Aug 31, 2019 •

edited

Loading

nickwalton commented Sep 5, 2019

jaydenmilne commented Sep 11, 2019

nickwalton commented Sep 11, 2019

daniekpo commented Oct 24, 2019

Always return full state dict (even when only one agent) #328

Always return full state dict (even when only one agent) #328

Comments

sethmnielsen commented Aug 31, 2019

sethmnielsen commented Aug 31, 2019 • edited Loading

nickwalton commented Sep 5, 2019

jaydenmilne commented Sep 11, 2019

nickwalton commented Sep 11, 2019

daniekpo commented Oct 24, 2019

sethmnielsen commented Aug 31, 2019 •

edited

Loading