-
Notifications
You must be signed in to change notification settings - Fork 0
Rezas expert #3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Rezas expert #3
Conversation
… example files. Now just need to use the map...
… on what it does with obstacles
| This program will load in the task logs in `log_dir` and use them to generate actions in the `navigation2` task. | ||
|
|
||
| Information about the arguments can be found by running `python3 NavigationExpert.py -h` | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add an example command
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file doesn't appear to actually run the expert algorithm, rather it constructions the agent action from the expert. Do you plan to merge the expert into this file? Maybe we call this file something like navigation_select_actions_from_expert.py
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do you anticipate we will integrate these expert demonstrations into the RLPD architecture. I imaging that the files in log_dir will act like our expert dataset, which we will then call to rollout an episode on the environment. Before pushing this to main, I think we should figure out this integration. For example, I think we could create Collector class that does "symmetric sampling", wherein it pulls have the batch from the expert and half from the policy. See the following link: https://pytorch.org/rl/reference/collectors.html
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have verified this file runs! Good job
expert/NavigationExpert.py
Outdated
| obs = [] | ||
| for s in range(n_steps): | ||
| step += 1 | ||
| print(f"Step {step}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of printing steps, please use a progress bar like tqdm
|
@shalin-j Just letting you know I plan to merge this in the next day or two! |
|
@piercehowell sounds good. I'll commit the updated collision reward tonight to handle obstacles. I also have a different version I'm working on locally to separate some of the expert logic from the scenario logic, but I think we would want to discuss how to structure that at some point to ensure everything works correctly with the expert integration in rlpd, so holding off on committing those changes to this branch. |
…gentSimulator into RezasExpert
No description provided.