How to specify perturbation function #2575

samuelkim16 · 2024-10-15T12:47:40Z

samuelkim16
Oct 15, 2024

I want to specify a function for the perturbation_set argument in InputPerturbation. I am using this for risk-adverse optimization, following closely the tutorial here. The documentation says that

Alternatively, this can be a callable that returns batch x n_p x d-dim tensor of perturbations for input of shape batch x d.

It works fine on training the model. However, when optimizing the acquisition function, instead of passing an input of shape batch x d, it passes an input of shape batch x a x d, where I don't know what this extra dimension corresponds to. This results in a shape error.

For context, this is the function I am using for perturbation_set:

def perturb_input(x):
    # draw_sobol_normal_samples returns size (n, d)
    # Draw random samples and expand batch dimension
    x_perturb = draw_sobol_normal_samples(d=problem_dim, n=N_W, **tkwargs)[None, :, :]
    # size (batch, n_p, d)
    x_perturb = x_perturb.repeat(x.shape[0], 1, 1)

    # perturbn dim0
    dim0_std = 0.05 * x[..., 0:1] / 2
    x_perturb[:, :, 0] *= dim0_std

    # perturb dim1
    x_perturb[:, :, 1] *= 0.01

    return x_perturb

Here is the training function (mostly taken from the tutorial) and optimization function:

bounds = torch.stack([torch.zeros(problem_dim), torch.ones(problem_dim)]).to(**tkwargs)

def train_model(train_X: Tensor, train_Y: Tensor) -> SingleTaskGP:
    r"""Returns a `SingleTaskGP` model trained on the inputs"""
    intf = InputPerturbation(
        # perturbation_set=draw_sobol_normal_samples(d=problem_dim, n=N_W, **tkwargs) * STD_DEV,
        perturbation_set=perturb_input,
        bounds=bounds,
    )
    model = SingleTaskGP(
        train_X, train_Y, input_transform=intf, outcome_transform=Standardize(m=1)
    )
    mll = ExactMarginalLogLikelihood(model.likelihood, model)
    fit_gpytorch_mll(mll)
    return model

risk_measure = VaR(alpha=ALPHA, n_w=N_W)

def optimize_acqf_and_get_observation():
    r"""Optimizes the acquisition function, and returns a new candidate and observation."""
    acqf = qNoisyExpectedImprovement(
        model=model,
        X_baseline=train_X,
        sampler=SobolQMCNormalSampler(sample_shape=torch.Size([128])),
        objective=risk_measure,
        prune_baseline=True,
    )

    candidate, _ = optimize_acqf(
        acq_function=acqf,
        bounds=bounds,
        q=BATCH_SIZE,
        num_restarts=NUM_RESTARTS,
        raw_samples=RAW_SAMPLES,
    )

    new_observations = fitness_fun(candidate)
    return candidate, new_observations

I would expect the input shape to perturb_input is [5, 2] but during optimization, the input shape is [128, 5, 2].

Here is the full error:

---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
Cell In[15], line 6
      4 print(f"Starting iteration {i}, total time: {time() - start_time:.3f} seconds.")
      5 # optimize the acquisition function and get the observations
----> 6 candidate, observations = optimize_acqf_and_get_observation()
      8 # update the model with new observations
      9 train_X = torch.cat([train_X, candidate], dim=0)

Cell In[14], line 13
      4 r"""Optimizes the acquisition function, and returns a new candidate and observation."""
      5 acqf = qNoisyExpectedImprovement(
      6     model=model,
      7     X_baseline=train_X,
   (...)
     10     prune_baseline=True,
     11 )
---> 13 candidate, _ = optimize_acqf(
     14     acq_function=acqf,
     15     bounds=bounds,
     16     q=BATCH_SIZE,
     17     num_restarts=NUM_RESTARTS,
     18     raw_samples=RAW_SAMPLES,
     19 )
     21 new_observations = fitness_fun(candidate)
     22 return candidate, new_observations

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\optim\optimize.py:567, in optimize_acqf(acq_function, bounds, q, num_restarts, raw_samples, options, inequality_constraints, equality_constraints, nonlinear_inequality_constraints, fixed_features, post_processing_func, batch_initial_conditions, return_best_only, gen_candidates, sequential, ic_generator, timeout_sec, return_full_tree, retry_on_optimization_warning, **ic_gen_kwargs)
    544     gen_candidates = gen_candidates_scipy
    545 opt_acqf_inputs = OptimizeAcqfInputs(
    546     acq_function=acq_function,
    547     bounds=bounds,
   (...)
    565     ic_gen_kwargs=ic_gen_kwargs,
    566 )
--> 567 return _optimize_acqf(opt_acqf_inputs)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\optim\optimize.py:588, in _optimize_acqf(opt_inputs)
    585     return _optimize_acqf_sequential_q(opt_inputs=opt_inputs)
    587 # Batch optimization (including the case q=1)
--> 588 return _optimize_acqf_batch(opt_inputs=opt_inputs)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\optim\optimize.py:275, in _optimize_acqf_batch(opt_inputs)
    272     batch_initial_conditions = opt_inputs.batch_initial_conditions
    273 else:
    274     # pyre-ignore[28]: Unexpected keyword argument `acq_function` to anonymous call.
--> 275     batch_initial_conditions = opt_inputs.get_ic_generator()(
    276         acq_function=opt_inputs.acq_function,
    277         bounds=opt_inputs.bounds,
    278         q=opt_inputs.q,
    279         num_restarts=opt_inputs.num_restarts,
    280         raw_samples=opt_inputs.raw_samples,
    281         fixed_features=opt_inputs.fixed_features,
    282         options=options,
    283         inequality_constraints=opt_inputs.inequality_constraints,
    284         equality_constraints=opt_inputs.equality_constraints,
    285         **opt_inputs.ic_gen_kwargs,
    286     )
    288 batch_limit: int = options.get(
    289     "batch_limit",
    290     (
   (...)
    294     ),
    295 )
    297 def _optimize_batch_candidates() -> Tuple[Tensor, Tensor, List[Warning]]:

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\optim\initializers.py:417, in gen_batch_initial_conditions(acq_function, bounds, q, num_restarts, raw_samples, fixed_features, options, inequality_constraints, equality_constraints, generator, fixed_X_fantasies)
    415 while start_idx < X_rnd.shape[0]:
    416     end_idx = min(start_idx + batch_limit, X_rnd.shape[0])
--> 417     Y_rnd_curr = acq_function(
    418         X_rnd[start_idx:end_idx].to(device=device)
    419     ).cpu()
    420     Y_rnd_list.append(Y_rnd_curr)
    421     start_idx += batch_limit

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\torch\nn\modules\module.py:1511, in Module._wrapped_call_impl(self, *args, **kwargs)
   1509     return self._compiled_call_impl(*args, **kwargs)  # type: ignore[misc]
   1510 else:
-> 1511     return self._call_impl(*args, **kwargs)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\torch\nn\modules\module.py:1520, in Module._call_impl(self, *args, **kwargs)
   1515 # If we don't have any hooks, we want to skip the rest of the logic in
   1516 # this function, and just call forward.
   1517 if not (self._backward_hooks or self._backward_pre_hooks or self._forward_hooks or self._forward_pre_hooks
   1518         or _global_backward_pre_hooks or _global_backward_hooks
   1519         or _global_forward_hooks or _global_forward_pre_hooks):
-> 1520     return forward_call(*args, **kwargs)
   1522 try:
   1523     result = None

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\utils\transforms.py:305, in concatenate_pending_points.<locals>.decorated(cls, X, **kwargs)
    303 if cls.X_pending is not None:
    304     X = torch.cat([X, match_batch_shape(cls.X_pending, X)], dim=-2)
--> 305 return method(cls, X, **kwargs)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\utils\transforms.py:259, in t_batch_mode_transform.<locals>.decorator.<locals>.decorated(acqf, X, *args, **kwargs)
    257 # add t-batch dim
    258 X = X if X.dim() > 2 else X.unsqueeze(0)
--> 259 output = method(acqf, X, *args, **kwargs)
    260 if hasattr(acqf, "model") and is_ensemble(acqf.model):
    261     # IDEA: this could be wrapped into SampleReducingMCAcquisitionFunction
    262     output = (
    263         output.mean(dim=-1) if not acqf._log else logmeanexp(output, dim=-1)
    264     )

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\acquisition\monte_carlo.py:274, in SampleReducingMCAcquisitionFunction.forward(self, X)
    254 @concatenate_pending_points
    255 @t_batch_mode_transform()
    256 def forward(self, X: Tensor) -> Tensor:
    257     r"""Computes the acquisition value associated with the input `X`. Weighs the
    258     acquisition utility values by smoothed constraint indicators if `constraints`
    259     was passed to the constructor of the class. Applies `self.sample_reduction` and
   (...)
    272         batch shape of model and input `X`.
    273     """
--> 274     non_reduced_acqval = self._non_reduced_forward(X=X)
    275     return self._sample_reduction(self._q_reduction(non_reduced_acqval))

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\acquisition\monte_carlo.py:287, in SampleReducingMCAcquisitionFunction._non_reduced_forward(self, X)
    277 def _non_reduced_forward(self, X: Tensor) -> Tensor:
    278     """Compute the constrained acquisition values at the MC-sample, q level.
    279 
    280     Args:
   (...)
    285         A Tensor with shape `sample_sample x batch_shape x q`.
    286     """
--> 287     samples, obj = self._get_samples_and_objectives(X)
    288     samples = repeat_to_match_aug_dim(target_tensor=samples, reference_tensor=obj)
    289     acqval = self._sample_forward(obj)  # `sample_sample x batch_shape x q`

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\acquisition\monte_carlo.py:597, in qNoisyExpectedImprovement._get_samples_and_objectives(self, X)
    594 X_full = torch.cat([match_batch_shape(self.X_baseline, X), X], dim=-2)
    595 # TODO: Implement more efficient way to compute posterior over both training and
    596 # test points in GPyTorch (https://github.com/cornellius-gp/gpytorch/issues/567)
--> 597 posterior = self.model.posterior(
    598     X_full, posterior_transform=self.posterior_transform
    599 )
    600 if not self._cache_root:
    601     samples_full = super().get_posterior_samples(posterior)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\models\gpytorch.py:385, in BatchedMultiOutputGPyTorchModel.posterior(self, X, output_indices, observation_noise, posterior_transform, **kwargs)
    382 self.eval()  # make sure model is in eval mode
    383 # input transforms are applied at `posterior` in `eval` mode, and at
    384 # `model.forward()` at the training time
--> 385 X = self.transform_inputs(X)
    386 with gpt_posterior_settings():
    387     # insert a dimension for the output dimension
    388     if self._num_outputs > 1:

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\models\model.py:229, in Model.transform_inputs(self, X, input_transform)
    227     return input_transform(X)
    228 try:
--> 229     return self.input_transform(X)
    230 except AttributeError:
    231     return X

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\torch\nn\modules\module.py:1511, in Module._wrapped_call_impl(self, *args, **kwargs)
   1509     return self._compiled_call_impl(*args, **kwargs)  # type: ignore[misc]
   1510 else:
-> 1511     return self._call_impl(*args, **kwargs)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\torch\nn\modules\module.py:1520, in Module._call_impl(self, *args, **kwargs)
   1515 # If we don't have any hooks, we want to skip the rest of the logic in
   1516 # this function, and just call forward.
   1517 if not (self._backward_hooks or self._backward_pre_hooks or self._forward_hooks or self._forward_pre_hooks
   1518         or _global_backward_pre_hooks or _global_backward_hooks
   1519         or _global_forward_hooks or _global_forward_pre_hooks):
-> 1520     return forward_call(*args, **kwargs)
   1522 try:
   1523     result = None

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\models\transforms\input.py:78, in InputTransform.forward(self, X)
     76 elif self.transform_on_eval:
     77     if fantasize.off() or self.transform_on_fantasize:
---> 78         return self.transform(X)
     79 return X

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\models\transforms\input.py:1417, in InputPerturbation.transform(self, X)
   1395 r"""Transform the inputs by adding `perturbation_set` to each input.
   1396 
   1397 For each `1 x d`-dim element in the input tensor, this will produce
   (...)
   1411     A `batch_shape x (q * n_p) x d`-dim tensor of perturbed inputs.
   1412 """
   1413 # NOTE: If we had access to n_p without evaluating _perturbations when the
   1414 # perturbation_set is a function, we could move this into `_transform`.
   1415 # Further, we could remove the two `transpose` calls below if one were
   1416 # willing to accept a different ordering of the transformed output.
-> 1417 self._perturbations = self._expanded_perturbations(X)
   1418 # make space for n_p dimension, switch n_p with n after transform, and flatten.
   1419 return self._transform(X.unsqueeze(-3)).transpose(-3, -2).flatten(-3, -2)

File c:\Users\username\AppData\Local\anaconda3\envs\envname\Lib\site-packages\botorch\models\transforms\input.py:1444, in InputPerturbation._expanded_perturbations(self, X)
   1442     p = p.expand(X.shape[-2], *p.shape)  # p is batch_shape x n x n_p x d
   1443 else:
-> 1444     p = p(X) if self.indices is None else p(X[..., self.indices])
   1445 return p.transpose(-3, -2)

Cell In[10], line 56
     54 # perturbn dim0
     55 dim0_std = 0.05 * x[..., 0:1] / 2
---> 56 x_perturb[:, :, 0] *= dim0_std
     58 # perturb dim1
     59 x_perturb[:, :, 1] *= 0.01

RuntimeError: The size of tensor a (128) must match the size of tensor b (5) at non-singleton dimension 1

Answered by saitcakmak

Oct 15, 2024

Hi @samuelkim16.

I would expect the input shape to perturb_input is [5, 2] but during optimization, the input shape is [128, 5, 2].

In BoTorch, we use batch evaluations within acquisition functions.

        perturbation_set: An `n_p x d`-dim tensor denoting the perturbations

            to be added to the inputs. Alternatively, this can be a callable that
            returns `batch x n_p x d`-dim tensor of perturbations for input of
            shape `batch x d`. This is useful for heteroscedastic perturbations.

The batch x d here refers to an arbitrary batch shape, which in your case is 128 x 5. You should update perturb_input to support batch inputs. Here's an (untested) example o…

View full answer

saitcakmak · 2024-10-15T15:12:31Z

saitcakmak
Oct 15, 2024
Collaborator

Hi @samuelkim16.

I would expect the input shape to perturb_input is [5, 2] but during optimization, the input shape is [128, 5, 2].

In BoTorch, we use batch evaluations within acquisition functions.

        perturbation_set: An `n_p x d`-dim tensor denoting the perturbations

            to be added to the inputs. Alternatively, this can be a callable that
            returns `batch x n_p x d`-dim tensor of perturbations for input of
            shape `batch x d`. This is useful for heteroscedastic perturbations.

The batch x d here refers to an arbitrary batch shape, which in your case is 128 x 5. You should update perturb_input to support batch inputs. Here's an (untested) example of what this could look like:

def perturb_input(x):
    batch_shape = x.shape[:-1]
    # draw_sobol_normal_samples returns size (n, d)
    # Draw random samples and expand batch dimension
    x_perturb = draw_sobol_normal_samples(d=problem_dim, n=N_W, **tkwargs)
    # size (batch, n_p, d)
    x_perturb = x_perturb.repeat(*batch_shape, 1, 1)

    # perturb dim0
    dim0_std = 0.05 * x[..., 0:1] / 2
    x_perturb[..., 0] = x_perturb * dim0_std

    # perturb dim1
    x_perturb[..., 1] = x_perturb[..., 1] * 0.01

    return x_perturb

I am also avoiding in-place modification of tensors here since it can lead to issues with autograd.

3 replies

samuelkim16 Oct 15, 2024
Author

Great, thank you!

For posterity, my perturbation function now looks something like this:

def perturb_input(x):
    batch_shape = x.shape[:-1]
    # draw_sobol_normal_samples returns size (n, d)
    # Draw random samples and expand batch dimension
    x_perturb = draw_sobol_normal_samples(d=problem_dim, n=N_W, **tkwargs)
    # size (batch, n_p, d)
    x_perturb = x_perturb.repeat(*batch_shape, 1, 1)

    # perturb dim0
    dim0_std = 0.05 * x[..., 0] / 2
    x_perturb[..., 0] = x_perturb[..., 0].clone() * dim0_std[..., None]

    # perturb dim1
    x_perturb[..., 1] = x_perturb[..., 1].clone() * 0.01

    return x_perturb

where .clone() had to be added to avoid the in-place modifications.

saitcakmak Oct 15, 2024
Collaborator

I guess x_perturb will later get added to x to create a new tensor that retains grad history from x (x_perturb does not have grad up to that point), so in-place modifications should not matter here.

samuelkim16 Oct 15, 2024
Author

It was giving me an error on in-place modifications at those two lines, which is why I added the .clone().

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to specify perturbation function #2575

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to specify perturbation function #2575

Uh oh!

Uh oh!

samuelkim16 Oct 15, 2024

Replies: 1 comment · 3 replies

Uh oh!

saitcakmak Oct 15, 2024 Collaborator

Uh oh!

samuelkim16 Oct 15, 2024 Author

Uh oh!

saitcakmak Oct 15, 2024 Collaborator

Uh oh!

samuelkim16 Oct 15, 2024 Author

samuelkim16
Oct 15, 2024

Replies: 1 comment 3 replies

saitcakmak
Oct 15, 2024
Collaborator

samuelkim16 Oct 15, 2024
Author

saitcakmak Oct 15, 2024
Collaborator

samuelkim16 Oct 15, 2024
Author