Preserving train inputs and targets through transforms #3044

hvarfner · 2025-10-14T19:03:48Z

Summary:
This PR preserves botorch transforms (specifically outcome_transforms, like Standardize) through state_dict loading. The fix also ensures that train_targets of a Leave-one-out model with outcome transforms will, in the default case, have the same targets as a base model, minus the point left out.

Longer explanation:
Transforms, and specifically learnable output transforms like Standardize, will currently:
a. Learn the parameters at initialization of the GP
b. Transform the train_Ys to the normalized space

Then, when we load a state dict, we will:
a. Impose new standardization parameters on already standardized data
b. Potentially make the transforms re-learnable, nullifying the change made by the state dict

This has undesired consequences for cross-validation, as all cross-validated models will effectively have different training data. In essence, we don't simply leave one point out, but instead we leave one out and re-standardize. When we have outliers in the data, this will lead to substantially different predictions when the outlier is left out, since the outlier will substantially impact the outcome transform parameters.

TODO:

Account for non-invertible transforms

Differential Revision: D84571407

meta-codesync · 2025-10-14T19:03:54Z

@hvarfner has exported this pull request. If you are a Meta employee, you can view the originating Diff in D84571407.

…3044) Summary: This PR preserves botorch transforms (specifically outcome_transforms, like Standardize) through state_dict loading. The fix also ensures that train_targets of a Leave-one-out model with outcome transforms will, in the default case, have the same targets as a base model, minus the point left out. __Longer explanation:__ Transforms, and specifically learnable output transforms like Standardize, will currently: a. Learn the parameters at initialization of the GP b. Transform the train_Ys to the normalized space Then, when we load a state dict, we will: a. Impose new standardization parameters on already standardized data b. Potentially make the transforms re-learnable, nullifying the change made by the state dict This has undesired consequences for cross-validation, as all cross-validated models will effectively have different training data. In essence, _we don't simply leave one point out, but instead we leave one out and re-standardize_. When we have outliers in the data, this will lead to substantially different predictions when the outlier is left out, since the outlier will substantially impact the outcome transform parameters. TODO: - Account for non-invertible transforms Differential Revision: D84571407

…3044) Summary: This PR preserves botorch transforms (specifically outcome_transforms, like Standardize) through state_dict loading. The fix also ensures that train_targets of a Leave-one-out model with outcome transforms will, in the default case, have the same targets as a base model, minus the point left out. __Longer explanation:__ Transforms, and specifically learnable output transforms like Standardize, will currently: a. Learn the parameters at initialization of the GP b. Transform the train_Ys to the normalized space Then, when we load a state dict, we will: a. Impose new standardization parameters on already standardized data b. Potentially make the transforms re-learnable, nullifying the change made by the state dict This has undesired consequences for cross-validation, as all cross-validated models will effectively have different training data. In essence, _we don't simply leave one point out, but instead we leave one out and re-standardize_. When we have outliers in the data, this will lead to substantially different predictions when the outlier is left out, since the outlier will substantially impact the outcome transform parameters. Notebook explaining the effect with some plots: N8342965 Reviewed By: Balandat Differential Revision: D84571407

codecov · 2025-10-16T19:47:01Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.98%. Comparing base (09502f9) to head (c47e45c).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3044   +/-   ##
=======================================
  Coverage   99.98%   99.98%           
=======================================
  Files         216      216           
  Lines       20581    20635   +54     
=======================================
+ Hits        20577    20631   +54     
  Misses          4        4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…3044) Summary: This PR preserves botorch transforms (specifically outcome_transforms, like Standardize) through state_dict loading. The fix also ensures that train_targets of a Leave-one-out model with outcome transforms will, in the default case, have the same targets as a base model, minus the point left out. __Longer explanation:__ Transforms, and specifically learnable output transforms like Standardize, will currently: a. Learn the parameters at initialization of the GP b. Transform the train_Ys to the normalized space Then, when we load a state dict, we will: a. Impose new standardization parameters on already standardized data b. Potentially make the transforms re-learnable, nullifying the change made by the state dict This has undesired consequences for cross-validation, as all cross-validated models will effectively have different training data. In essence, _we don't simply leave one point out, but instead we leave one out and re-standardize_. When we have outliers in the data, this will lead to substantially different predictions when the outlier is left out, since the outlier will substantially impact the outcome transform parameters. Notebook explaining the effect with some plots: N8342965 Reviewed By: Balandat Differential Revision: D84571407

…3044) Summary: This PR preserves botorch transforms (specifically outcome_transforms, like Standardize) through state_dict loading. The fix also ensures that train_targets of a Leave-one-out model with outcome transforms will, in the default case, have the same targets as a base model, minus the point left out. __Longer explanation:__ Transforms, and specifically learnable output transforms like Standardize, will currently: a. Learn the parameters at initialization of the GP b. Transform the train_Ys to the normalized space Then, when we load a state dict, we will: a. Impose new standardization parameters on already standardized data b. Potentially make the transforms re-learnable, nullifying the change made by the state dict This has undesired consequences for cross-validation, as all cross-validated models will effectively have different training data. In essence, _we don't simply leave one point out, but instead we leave one out and re-standardize_. When we have outliers in the data, this will lead to substantially different predictions when the outlier is left out, since the outlier will substantially impact the outcome transform parameters. Notebook explaining the effect with some plots: N8342965 Reviewed By: Balandat, saitcakmak Differential Revision: D84571407

meta-codesync · 2025-10-18T01:46:27Z

This pull request has been merged in b0d492d.

meta-cla bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Oct 14, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 14, 2025

hvarfner force-pushed the export-D84571407 branch from 2466e2b to e17f003 Compare October 14, 2025 19:30

hvarfner force-pushed the export-D84571407 branch from e17f003 to 24f101c Compare October 15, 2025 16:44

hvarfner force-pushed the export-D84571407 branch 2 times, most recently from 9460560 to a4163d0 Compare October 16, 2025 19:35

hvarfner force-pushed the export-D84571407 branch from a4163d0 to 50dc3ba Compare October 17, 2025 00:39

hvarfner force-pushed the export-D84571407 branch from 50dc3ba to 9109fe4 Compare October 17, 2025 00:58

hvarfner force-pushed the export-D84571407 branch from 9109fe4 to 9341715 Compare October 17, 2025 02:01

hvarfner force-pushed the export-D84571407 branch from 9341715 to 82223a3 Compare October 17, 2025 13:49

hvarfner force-pushed the export-D84571407 branch from 82223a3 to ce7c08f Compare October 17, 2025 13:54

hvarfner force-pushed the export-D84571407 branch from ce7c08f to 2fdfe78 Compare October 17, 2025 14:02

hvarfner force-pushed the export-D84571407 branch from 2fdfe78 to c47e45c Compare October 17, 2025 20:08

meta-codesync bot closed this in b0d492d Oct 18, 2025

facebook-github-bot added the Merged label Oct 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preserving train inputs and targets through transforms #3044

Preserving train inputs and targets through transforms #3044

Uh oh!

hvarfner commented Oct 14, 2025

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

codecov bot commented Oct 16, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Preserving train inputs and targets through transforms #3044

Preserving train inputs and targets through transforms #3044

Uh oh!

Conversation

hvarfner commented Oct 14, 2025

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

codecov bot commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

meta-codesync bot commented Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Oct 16, 2025 •

edited

Loading