Unify flow matching and score-based models #1497

StarostinV · 2025-03-19T23:01:06Z

What does this PR do?

This unifies score-based models with flow matching under a common API. Furthermore, based on the following paper https://arxiv.org/abs/2410.02217, one can use the vector field learnt with flow matching to calculate the time-dependent score and marginal distributions, as well as drift and diffusion functions for SDE-based sampling. This effectively enables the use of all score-based methods (SDE-based sampling, iid, gradient evaluation, map with score-based gradient, guidance, etc) with flow matching.

API of FMPE and NPSE didn't change, but they are now wrappers around the same class, VectorFieldInference, and only differ in the following:

default sample_with value in build_posterior method (sde for NPSE and ode for FMPE),
estimator builder (FlowMatchingEstimator and ScoreBasedEstimator that are subclasses of ConditionalVectorFieldEstimator),
loss function and concrete SDE & ODE functions provided by the estimator.

Additionally, the ode solver is isolated into a separate API, which enables swapping ode backends. So far, only zuko backend is implemented.

Does this close any issues?

#1440 and #1462

Anything else we should know?

The following features have been implemented:

Related issues

This PR does not change the NN builders and does not unify NN architectures, which is another issue Better NN architectures #1442

✅ Checklist

Put an x in the boxes that apply. If you're unsure about any of them, no worries — just ask!

I have read and followed the contribution guidelines
I have added helpful comments to my code where needed
I have added tests for new functionality
(If applicable) I have reported how long new tests run and marked them with pytest.mark.slow

For reviewers:

I have reviewed every file
All comments have been addressed

…to unify-scores-and-flows

…ple_and_log_prob The test relies on sample and log_prob methods of the estimators. FlowMatchiningEstimator does not implement these methods.

VectorFieldPosterior does not yet support norm_posterior

the test assumes DirectPosterior and does not support new vector field implementation of FMPE

janfb · 2025-03-25T09:47:31Z

@janfb @manuelgloeckler I have some good news: the iid methods work great with flow matching! However, so far the metrics in (slow iid) tests are not so great, and that is because of the neural network architecture. I trained a model independently using a very simple architecture and c2st is much better for iid sampling with flow matching. That is another PR though.

that's great! but what is the difference in the NN architecture between ours and your simple one?

janfb

Thanks again @StarostinV for another round of big effort! This looks almost done now 👍

Added a couple of minor comments and questions.

sbi/inference/posteriors/vector_field_posterior.py

sbi/inference/potentials/score_fn_iid.py

sbi/inference/trainers/npse/vector_field_inference.py

sbi/neural_nets/estimators/base.py

sbi/neural_nets/estimators/score_estimator.py

tests/linearGaussian_vector_field_test.py

StarostinV · 2025-03-25T16:55:42Z

@janfb @manuelgloeckler I have some good news: the iid methods work great with flow matching! However, so far the metrics in (slow iid) tests are not so great, and that is because of the neural network architecture. I trained a model independently using a very simple architecture and c2st is much better for iid sampling with flow matching. That is another PR though.

that's great! but what is the difference in the NN architecture between ours and your simple one?

I haven't looked into the current architecture, but we made some quick tests during the hackathon and it was clear that the architecture for flow matching performs substantially worse than the simple one below. However, it could also be because of the number of parameters. It will become clear after unifying net builders for scores and flows.

I just use some MLP with skip connections and time embeddings for tests, but I wouldn't advertise it since it could be improved in many ways :) Maybe I just put it here for the reference:

class SimpleNet(VectorFieldNet):
    def __init__(
            self,
            in_dim: int = 2,
            condition_dim: int = 2,
            out_dim: int = 2, 
            hid_dim: int = 256, 
            time_emb_dim: int = 16,
            num_blocks: int = 3,
        ):
        super().__init__()
        self.time_embedding = SinusoidalTimeEmbedding(time_emb_dim)

        in_dims = [in_dim + time_emb_dim + condition_dim] + [hid_dim] * (num_blocks - 1)
        out_dims = [hid_dim] * (num_blocks - 1) + [out_dim]

        self.net = nn.Sequential(
            *[
                ResidualBlock(in_dim, out_dim, hid_dim=hid_dim) 
                for in_dim, out_dim in zip(in_dims, out_dims)
            ],
        )

    def forward(self, theta: torch.Tensor, x: torch.Tensor, t: torch.Tensor) -> torch.Tensor:
        x = torch.cat([theta, x, self.time_embedding(t)], dim=-1)
        x = self.net(x)
        return x


class ResidualBlock(nn.Module):
    def __init__(self, in_dim: int, out_dim: int, hid_dim: int = 128):
        super().__init__()
        self.net = nn.Sequential(
            nn.Linear(in_dim, hid_dim),
            nn.LeakyReLU(),
            nn.LayerNorm(hid_dim),
            nn.Linear(hid_dim, out_dim),
        )

        self.residual = nn.Linear(in_dim, out_dim)

    def forward(self, x: torch.Tensor) -> torch.Tensor:
        return self.net(x) + self.residual(x)

janfb

Heroic effort @StarostinV 🏅 🚀

looks all good now, thanks for your patients with all my comments! 🙏
looking forward to seeing this in action.

codecov · 2025-04-25T10:13:24Z

Codecov Report

Attention: Patch coverage is 90.73171% with 38 lines in your changes missing coverage. Please review.

Project coverage is 79.56%. Comparing base (8900ca0) to head (369477d).
Report is 33 commits behind head on main.

Files with missing lines	Patch %	Lines
...i/neural_nets/estimators/flowmatching_estimator.py	85.45%	8 Missing ⚠️
.../inference/trainers/npse/vector_field_inference.py	95.42%	7 Missing ⚠️
sbi/neural_nets/estimators/base.py	75.00%	7 Missing ⚠️
sbi/inference/potentials/vector_field_potential.py	75.00%	6 Missing ⚠️
sbi/inference/potentials/score_fn_iid.py	80.00%	5 Missing ⚠️
sbi/inference/posteriors/vector_field_posterior.py	91.30%	2 Missing ⚠️
sbi/samplers/ode_solvers/base.py	92.85%	2 Missing ⚠️
sbi/samplers/ode_solvers/ode_builder.py	85.71%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1497      +/-   ##
==========================================
- Coverage   89.45%   79.56%   -9.90%     
==========================================
  Files         128      133       +5     
  Lines       10170    10201      +31     
==========================================
- Hits         9098     8116     -982     
- Misses       1072     2085    +1013

Flag	Coverage Δ
unittests	`79.56% <90.73%> (-9.90%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sbi/diagnostics/sbc.py	`92.95% <100.00%> (ø)`
sbi/inference/__init__.py	`100.00% <ø> (ø)`
sbi/inference/posteriors/__init__.py	`100.00% <100.00%> (ø)`
sbi/inference/potentials/__init__.py	`100.00% <100.00%> (ø)`
sbi/inference/trainers/fmpe/fmpe.py	`100.00% <100.00%> (+5.71%)`	⬆️
sbi/inference/trainers/npse/npse.py	`100.00% <100.00%> (+2.79%)`	⬆️
sbi/neural_nets/estimators/__init__.py	`100.00% <ø> (ø)`
sbi/neural_nets/estimators/score_estimator.py	`93.25% <100.00%> (+0.35%)`	⬆️
sbi/samplers/ode_solvers/__init__.py	`100.00% <100.00%> (ø)`
sbi/samplers/ode_solvers/zuko_ode.py	`100.00% <100.00%> (ø)`
... and 10 more

... and 33 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

StarostinV added 29 commits March 19, 2025 09:43

added neural_odes

64e450d

move vector field estimator and extend

677f29f

common methods for flow and sde estimators

770e29a

Merge branch 'unify-scores-and-flows' of github.com:StarostinV/sbi in…

47cc401

…to unify-scores-and-flows

rename neural_odes folder to ode_solvers

9b33635

Update vector_field_estimator.py

03e417b

update to work with vector field instances

ba2a032

Update score_estimator.py

7caeaf8

Update flowmatching_estimator.py

57e2483

add ConditionalVectorFieldEstimator to imports

15876bc

Create vector_field_inference.py

0b90470

NPSE is not subclass of VectorFieldInference

ed6de61

FMPE is not subclass of VectorFieldInference

ab0cbef

Create vector_field_potential.py

6d91542

Make score_fn_iid work with ConditionalVectorFieldEstimator

bf8bf58

Update score_based_potential.py

ca28feb

Create vector_field_posterior.py

b9b7432

use new vector_field_estimator_based_potential

47bb52d

remove flow builders from test_correctness_of_batched_vs_seperate_sam…

05c3ae9

…ple_and_log_prob The test relies on sample and log_prob methods of the estimators. FlowMatchiningEstimator does not implement these methods.

import vector_field_estimator_based_potential

afc363c

ruff format

5e5a299

remove uniform prior from test_c2st_fmpe_on_linearGaussian

13bb3e1

VectorFieldPosterior does not yet support norm_posterior

correct init of VectorFieldPosterior

a878ecb

remove FMPE from test_embedding_net_api

e22eca7

the test assumes DirectPosterior and does not support new vector field implementation of FMPE

refactor

2d365a7

call score method to support flow matching

b819bf4

add vector field imports

493fe49

refactor

9f25ec7

ruff format

8084539

manuelgloeckler marked this pull request as ready for review March 20, 2025 08:21

StarostinV added 8 commits March 24, 2025 20:12

change condition_event_shape from 7 to 2

5fc694d

move ConditionalVectorFieldEstimator to base

3cadaff

refactor ode solvers

ae150ab

add VectorFieldEstimatorBuilder protocol and refactor

7c25430

use Literal for type annotation

e48d715

minor refactor

0055c67

fix docstring

d0c3725

add kwargs to constructor

5eb1c05

This was referenced Mar 24, 2025

Merge and enhance score and flow matching net builders #1501

Closed

Add GPU tests to vector field methods #1530

Closed

StarostinV added 3 commits March 24, 2025 22:44

minor refactoring

a168d23

update docstring and refactor

b2e9f8d

Merge remote-tracking branch 'upstream/main' into unify-scores-and-flows

b4ee280

janfb mentioned this pull request Mar 25, 2025

improve interface for custom density estimators passed to inference classes #1531

Open

janfb requested changes Mar 25, 2025

View reviewed changes

StarostinV added 5 commits March 25, 2025 15:14

Merge remote-tracking branch 'upstream/main' into unify-scores-and-flows

e7def54

import check_c2st from sbi.utils.metrics

7966120

better note

23ddc66

minor refactoring

a7e4f84

Merge remote-tracking branch 'upstream/main' into unify-scores-and-flows

369477d

StarostinV requested a review from janfb March 25, 2025 16:46

janfb approved these changes Mar 25, 2025

View reviewed changes

janfb mentioned this pull request Mar 25, 2025

FMPE not working even on simple tasks? #1374

Open

janfb merged commit 070910c into sbi-dev:main Mar 25, 2025
2 checks passed

janfb mentioned this pull request Mar 25, 2025

bug: docs building is failing because of missing score posterior #1535

Open

StarostinV mentioned this pull request Mar 25, 2025

Normalization in the forward function of FlowMatchingEstimator is wrong. #1462

Closed

jorobledo mentioned this pull request Mar 26, 2025

posterior.to(device) #1527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify flow matching and score-based models #1497

Unify flow matching and score-based models #1497

StarostinV commented Mar 19, 2025 •

edited

Loading

janfb commented Mar 25, 2025

janfb left a comment

StarostinV commented Mar 25, 2025

janfb left a comment

codecov bot commented Apr 25, 2025 •

edited

Loading

Unify flow matching and score-based models #1497

Unify flow matching and score-based models #1497

Conversation

StarostinV commented Mar 19, 2025 • edited Loading

What does this PR do?

Does this close any issues?

Anything else we should know?

Related issues

✅ Checklist

janfb commented Mar 25, 2025

janfb left a comment

Choose a reason for hiding this comment

StarostinV commented Mar 25, 2025

janfb left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 25, 2025 • edited Loading

Codecov Report

StarostinV commented Mar 19, 2025 •

edited

Loading

codecov bot commented Apr 25, 2025 •

edited

Loading