[rllib] Observations are forced to be a float numpy array? #13432

deeplearningrobotics · 2021-01-14T01:16:26Z

What is the problem?

Ray version 1.0.1 with TF 2.2

I have complex observations containing of strings but also float arrays. It seems RLlib only accepts float numpy arrays or dicts/lists of those as observations from a custom environment.

Does some documentation exist on how to get complex non-float observations to work?
Either complex observations always end up as a float tensor or RLlib throws an error because it assumes that the observations are a numpy array.

Reproduction (REQUIRED)

I will work on providing some standalone example as soon as I can. But this requires a bit more work as I need to provide a custom environment and a custom model.

richardliaw · 2021-01-14T23:09:09Z

cc @sven1977

Bam4d · 2021-03-18T13:02:12Z

@deeplearningrobotics I have been working around this by putting observations that are not tensors into the 'info' dict and then retrieving them later on in the policy by overriding compute_actions_from_input_dict. Its pretty hacky but it works 🚀

maxsnijders · 2021-09-26T17:59:47Z

@Bam4d if you have any details on how to get the info dict inside compute_actions_from_input_dict then that would be very helpful! There's a kwarg for info_dict, but if I breakpoint there it doesn't seem like the "info" that's returned from step() by the environment through it's last return value is present.

Bam4d · 2021-09-27T09:56:58Z

I think this bug was fixed in one of the latest versions. 1.6.0 perhaps

deeplearningrobotics added bug Something that is supposed to be working; but isn't triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Jan 14, 2021

richardliaw added rllib and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Jan 14, 2021

ericl added this to the RLlib Bugs milestone Mar 11, 2021

ericl removed the rllib label Mar 11, 2021

richardliaw added the rllib RLlib related issues label Oct 5, 2021

anyscalesam removed this from the RLlib Bugs milestone Jun 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Observations are forced to be a float numpy array? #13432

[rllib] Observations are forced to be a float numpy array? #13432

deeplearningrobotics commented Jan 14, 2021

richardliaw commented Jan 14, 2021

Bam4d commented Mar 18, 2021

maxsnijders commented Sep 26, 2021

Bam4d commented Sep 27, 2021

[rllib] Observations are forced to be a float numpy array? #13432

[rllib] Observations are forced to be a float numpy array? #13432

Comments

deeplearningrobotics commented Jan 14, 2021

What is the problem?

Reproduction (REQUIRED)

richardliaw commented Jan 14, 2021

Bam4d commented Mar 18, 2021

maxsnijders commented Sep 26, 2021

Bam4d commented Sep 27, 2021