Skip to content

Conversation

@RezaTorbati
Copy link

No description provided.

This program will load in the task logs in `log_dir` and use them to generate actions in the `navigation2` task.

Information about the arguments can be found by running `python3 NavigationExpert.py -h`

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add an example command

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file doesn't appear to actually run the expert algorithm, rather it constructions the agent action from the expert. Do you plan to merge the expert into this file? Maybe we call this file something like navigation_select_actions_from_expert.py

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How do you anticipate we will integrate these expert demonstrations into the RLPD architecture. I imaging that the files in log_dir will act like our expert dataset, which we will then call to rollout an episode on the environment. Before pushing this to main, I think we should figure out this integration. For example, I think we could create Collector class that does "symmetric sampling", wherein it pulls have the batch from the expert and half from the policy. See the following link: https://pytorch.org/rl/reference/collectors.html

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have verified this file runs! Good job

obs = []
for s in range(n_steps):
step += 1
print(f"Step {step}")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of printing steps, please use a progress bar like tqdm

@piercehowell
Copy link
Collaborator

@shalin-j Just letting you know I plan to merge this in the next day or two!

@shalin-jain
Copy link

@piercehowell sounds good. I'll commit the updated collision reward tonight to handle obstacles. I also have a different version I'm working on locally to separate some of the expert logic from the scenario logic, but I think we would want to discuss how to structure that at some point to ensure everything works correctly with the expert integration in rlpd, so holding off on committing those changes to this branch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants