Adds training benchmark unit tests with input config #2503

matthewtrepte · 2025-05-15T20:43:17Z

Description

Add unit test for training and evaluating environments across all workflows using an input config.

The config determines which environments to train, how long to train, and training KPI thresholds.

Every training + evaluation is one pytest, which fails if any of the thresholds aren't met.

A KPI json can be created to summarize the trainings. This json also functions as a KPI payload which will be uploaded to a Grafana Dashboard every week using OSMO CI, using the full mode in the config. Can be used to track improvements and regressions.

Dashboard - LINK

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Screenshots

Please attach before and after screenshots of the change if applicable.

Checklist

I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

kellyguo11

Thanks for putting this together! Based on Alexander's concerns right now, let's add the regression tests to TESTS_TO_SKIP in test_settings.py for now and work with him to create a new job for these that can be manually triggered.

# Description  Add unit test for training and evaluating environments across all workflows using an input config. The config determines which environments to train, how long to train, and training KPI thresholds. Every training + evaluation is one pytest, which fails if any of the thresholds aren't met. A KPI json can be created to summarize the trainings. This json also functions as a KPI payload which will be uploaded to a Grafana Dashboard every week using OSMO CI, using the full mode in the config. Can be used to track improvements and regressions. Dashboard - [LINK](https://grafana.nvidia.com/d/cejwy2w46gtmob/24b94e54-6b8b-50a4-969a-121dc573a34c?orgId=105)  ## Type of change  - Bug fix (non-breaking change which fixes an issue) - New feature (non-breaking change which adds functionality) - Breaking change (fix or feature that would cause existing functionality to not work as expected) - This change requires a documentation update ## Screenshots Please attach before and after screenshots of the change if applicable.  ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there  --------- Co-authored-by: Kelly Guo <kellyg@nvidia.com> Co-authored-by: Kelly Guo <kellyguo123@hotmail.com>

moving to public repo

e4f5100

matthewtrepte requested review from Mayankm96, jsmith-bdai and kellyguo11 as code owners May 15, 2025 20:43

kellyguo11 reviewed May 16, 2025

View reviewed changes

matthewtrepte added 7 commits May 20, 2025 12:52

preparing for final review

61d58cc

Merge branch 'main' into mtrepte/auto-regression-tests

a8d611f

rm commented out line

28b94ea

preparing for review

575e669

adjust some comments

0cd40f7

fix default arg

86dbaee

misc files

04cc86b

matthewtrepte requested a review from hhansen-bdai as a code owner May 21, 2025 23:36

matthewtrepte and others added 2 commits May 21, 2025 17:42

final cfg tweaks

8dc7260

run in CI

3c958c1

kellyguo11 changed the title ~~adding training benchmark unit tests with input config~~ Adds training benchmark unit tests with input config May 22, 2025

kellyguo11 and others added 5 commits May 23, 2025 00:22

trigger CI

1d63897

trigger CI

464c81c

increase time threshold for anymal c

375ed14

increase time threshold

584fdcb

Merge branch 'main' into mtrepte/auto-regression-tests

3f34cb4

kellyguo11 merged commit 4737968 into main May 23, 2025
3 of 4 checks passed

kellyguo11 deleted the mtrepte/auto-regression-tests branch May 23, 2025 21:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds training benchmark unit tests with input config #2503

Adds training benchmark unit tests with input config #2503

matthewtrepte commented May 15, 2025 •

edited

Loading

Uh oh!

kellyguo11 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adds training benchmark unit tests with input config #2503

Adds training benchmark unit tests with input config #2503

Conversation

matthewtrepte commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Screenshots

Checklist

Uh oh!

kellyguo11 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

matthewtrepte commented May 15, 2025 •

edited

Loading