Refactor of model - sampler interactions #398

henrymoss · 2021-11-01T10:16:09Z

This PR is a prerequisite for @sebastianober's MCEI acquisition function PR and adding in the functionality from out S-GP-TS paper.

Previously we had a set of models and a separate set of samplers (of varying types) defined over with our acquisition functions. Certain samplers only worked for certain models and we dealt with this by having lots of error traps to check the suitability of models when attempting to sample from them. This is not a scalable setup, especially as we have @sebastianober's new deepGP models and their associated custom samplers.

We have three types of samplers

reparameterization samplers: as used when optimizing Monte-Carlo acquisition functions, i.e BatchReparameterizationSampler and @sebastianober 's new reparam sample for deepGPs
trajectory samplers: e.g. using random Fourier features to generate sample paths from models. Note that, decoupled Thompson sampling (from GPflux) would also count as a trajectory samplers.
Thompson Samplers: samplers that return samples from the min value of the model, e.g. Gumbel/ExactThompsonSampling.

This PR has two key parts:

I have split our sampler into the three separate types explained above. Note that this change required enabling a jitter param for our IndependentReparameterizationSampler, which is probably a nice feature to have (to make it consistent with the batch version) and perhaps not that contentious as IndependentReparameterizationSampler is not actually used in our code base!
Secondly, I have created two new methods for our models: reparam_sampler and trajectory_sampler that are defined when defining a new model and call the reparameterization and trajectory samplers relevant for the model. This makes an explicit link between the models and their supported samplers and has allowed me to move the relevant samplers into the models part of the code base. For example, the RFF sampler goes near GaussianProcessRegression and BatchReparameterizationSampler now lives near GPFlowPredictor. Note that the Thompson samplers still live near acquisition functions.

Crucially, this PR paves the way for @sebastianober to define his custom GPflux trajectory/reparam samplers in a way that means they are easily accessible by the GPFlux model and his new MCEI acquisition functions can work with our existing models and our MC acquisition functions can work with his (i.e. they all just require a model with a model.reparam_sampler method rather than requiring a list of explicit names for supported models).

I have update the sampling used within the MES and GIBBON acquisition functions and the discrete Thompson sampling ruled to work with the new changes, however, this part of the code will be made much nicer as soon as this PR is closed. Using the work from this PR, we can now just pass in our desired type of Thompson Sampler when defining these functions/rules which is more Pythonic and requires a lot less code.

uri-granta

(still looking, but a couple of early questions)

trieste/acquisition/rule.py

trieste/acquisition/sampler.py

trieste/models/interfaces.py

trieste/models/gpflow/sampler.py

trieste/acquisition/rule.py

tests/unit/models/gpflow/test_sampler.py

sebastianober · 2021-11-01T12:11:18Z

Looks good to me in general. My main issue is that TrajectorySampler implicitly seems to be assuming we'll use RFF (or similar) for all models, but I don't think this should necessarily be the case, and should allow for a broader range of trajectory samplers.

henrymoss · 2021-11-01T12:56:56Z

Looks good to me in general. My main issue is that TrajectorySampler implicitly seems to be assuming we'll use RFF (or similar) for all models, but I don't think this should necessarily be the case, and should allow for a broader range of trajectory samplers.

Yeah. So the idea will be to define what sort of trajectory sampler you want when you define the model. For now that is defaulting to RFF, but as more methods become available (e.g. quadrature or even exact sampling) you will be able to choose between them. This is similar to Gpflux, where you need to define you chosen kernel decomposition when building the model.

uri-granta

Reset function looks good, remaining question about stack model.

trieste/models/gpflow/sampler.py

trieste/models/interfaces.py

trieste/acquisition/__init__.py

trieste/acquisition/function/entropy.py

uri-granta

Just a couple more of comments

trieste/models/interfaces.py

trieste/acquisition/function/function.py

hstojic · 2021-12-06T10:10:13Z

trieste/models/gpflow/models.py

@@ -27,9 +27,10 @@
 from ...data import Dataset
 from ...types import TensorType
 from ...utils import DEFAULTS, jit
-from ..interfaces import FastUpdateModel, TrainableProbabilisticModel
+from ..interfaces import FastUpdateModel, TrainableProbabilisticModel, TrajectorySampler


same as with FastUpdateModel we should probably have ReparametrizationSamplerModel and TrajectorySamplerModel interface - then we can also be more precise in each acquisition function what type of probabilistic model user needs

(although we still need to check existence of methods at runtime as users might not use mypy)

trieste/models/gpflow/models.py

hstojic · 2021-12-06T10:50:17Z

trieste/models/interfaces.py

+        """
+        raise NotImplementedError(f"Model {self!r} does not have a reparametrization sampler")
+
+    def trajectory_sampler(self) -> TrajectorySampler:


now that we have the FastUpdateModel interface, we should probably move these methods to new interfaces that we would put here, see my comment above...

trieste/models/interfaces.py

trieste/models/__init__.py

trieste/models/gpflow/sampler.py

hstojic · 2021-12-06T11:38:57Z

trieste/acquisition/function/multi_objective.py

@@ -255,13 +254,21 @@ def prepare_acquisition_function(
        # hypervolume improvement in this area
        _partition_bounds = prepare_default_non_dominated_partition_bounds(_reference_pt, _pf.front)

-        sampler = BatchReparametrizationSampler(self._sample_size, model)
+        try:
+            sampler = model.reparam_sampler(self._sample_size)


if we implement the ReparametrizationSamplerModel interface, then we can also be more precise above in specifying the type of the probabilistic model

trieste/acquisition/sampler.py

hstojic · 2021-12-06T12:24:01Z

trieste/acquisition/sampler.py

-        )  # [S, 0]
-
-    def sample(self, at: TensorType) -> TensorType:
+    def sample(self, model: ProbabilisticModel, sample_size: int, at: TensorType) -> TensorType:


if we implement the new interface, then we can be more precise here with TrajectorySamplerModel

trieste/acquisition/sampler.py

tests/unit/models/gpflow/test_interface.py

tests/unit/models/gpflow/test_models.py

tests/unit/acquisition/test_sampler.py

henrymoss added 3 commits October 29, 2021 13:18

WIP

a957eba

nearly there

67ec470

ready to go

1bf359d

henrymoss requested review from uri-granta, hstojic, sebastianober, vpicheny and apaleyes and removed request for uri-granta November 1, 2021 10:16

henrymoss added 4 commits November 1, 2021 10:21

fix test

d75d8c8

WIP

6bc92ae

Merge branch 'develop' of github.com:henrymoss/trieste into develop

8cf8a67

Merge branch 'develop' into henry/refactor_samplers

c459964

uri-granta reviewed Nov 1, 2021

View reviewed changes

trieste/acquisition/rule.py Show resolved Hide resolved

trieste/acquisition/sampler.py Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

trieste/models/interfaces.py Outdated Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

trieste/models/interfaces.py Outdated Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

trieste/models/interfaces.py Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

trieste/models/gpflow/sampler.py Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

trieste/acquisition/rule.py Outdated Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

tests/unit/models/gpflow/test_sampler.py Outdated Show resolved Hide resolved

sebastianober reviewed Nov 1, 2021

View reviewed changes

tests/unit/models/gpflow/test_sampler.py Outdated Show resolved Hide resolved

henrymoss added 6 commits November 1, 2021 13:55

fixed tests

3ce42a3

WIP

052fcb3

WIP

02ef0d8

Merge branch 'develop' of github.com:henrymoss/trieste into develop

9bf7d77

Merge branch 'develop' into henry/refactor_samplers

e76dad6

WIP

061e860

uri-granta reviewed Dec 6, 2021

View reviewed changes

trieste/models/gpflow/sampler.py Outdated Show resolved Hide resolved

trieste/models/gpflow/sampler.py Outdated Show resolved Hide resolved

trieste/models/interfaces.py Show resolved Hide resolved

trieste/models/interfaces.py Outdated Show resolved Hide resolved