feat: add differential abundance for scvi-tools models #3618

lordy5 · 2025-11-24T20:35:33Z

@ori-kron-wis @canergen I've finished fixing some issues with the code and have now added tests. Let me know if there's anything else I should do.

for more information, see https://pre-commit.ci

codecov · 2025-11-24T20:44:38Z

Codecov Report

❌ Patch coverage is 95.34884% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 74.92%. Comparing base (7ce63e0) to head (210ddf0).

Files with missing lines	Patch %	Lines
src/scvi/model/base/_base_model.py	33.33%	2 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (7ce63e0) and HEAD (210ddf0). Click for more details.

HEAD has 118 uploads less than BASE

Flag BASE (7ce63e0) HEAD (210ddf0)

121 3

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3618      +/-   ##
==========================================
- Coverage   84.70%   74.92%   -9.78%     
==========================================
  Files         225      225              
  Lines       21637    21680      +43     
==========================================
- Hits        18327    16244    -2083     
- Misses       3310     5436    +2126

Files with missing lines	Coverage Δ
src/scvi/model/base/_vaemixin.py	`92.24% <100.00%> (-2.14%)`	⬇️
src/scvi/model/base/_base_model.py	`76.02% <33.33%> (-5.04%)`	⬇️

... and 44 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…data

for more information, see https://pre-commit.ci

ori-kron-wis

Thanks @lordy5, I have a few questions/clarifications, mainly engineering wise.
The general idea is to do as much code reuse, add more tests and have the support for the generalization of this function to all models in a proper way.

So far, we had the DA support for CytoVi and MrVI only.

Is this implementation here should be general enough for all models?
If this is the case, I would expect this PR to have more tests, for other models as well. I would at least add tests for SCANVI, TOTALVI (with mudata input) and a spatial model (RESOLVI, SCVIVA). This is something users are going to try out and lets make sure it works.
If theres no support for specific models I would add a warning of "DA is currently not supported" - This is something we did for get_normalized_expression function.
However, (3) was achieved because get_normalized_expression and for that manner, differential_expression, are implemented in _vaemixin.py. So model that does not inherit this mixin will not have it and the message of "not implemented" will appear. In your implementation all is implemented directly in base path in da_pytoch, but like _de_core, we might be missing the wrapper to the DA in _vaemixin.py (like we do for DE) - so this is how I would recommend it to be implemented, otherwise it is has less control and will likely break for users who try out anything.
So, seems CytoVi and mrVI added their own things on top of your DA, its fine and we have many cases that a model expand on a base/mixin function, question is whether your implementation can support those models implementations with minimum additions, if so, much code can be removed (we want to reuse as much as possible). In such case need to update those models as well and validate for correctness in their tutorials.
(5) is also relvant as @canergen responded to a user that we are going to have the memory saving version of DA in MRVI soon (#3624 (comment)) however as I wrote right now, the implementation here has no connection to the MRVI implementation unless we will do something about it and we should (update mrvi's DA or have this one supports it).
Add changlog

canergen · 2025-12-14T10:22:19Z

Some comments

Is this implementation here should be general enough for all models?
Yes, we should check with @florianingelfinger for cytoVI though.

If this is the case, I would expect this PR to have more tests, for other models as well. I would at least add tests for SCANVI, TOTALVI (with mudata input) and a spatial model (RESOLVI, SCVIVA). This is something users are going to try out and lets make sure it works.
Sounds good. Let’s not blast up test time though.

If theres no support for specific models I would add a warning of "DA is currently not supported" - This is something we did for get_normalized_expression function.
It should be for all models except Stereoscope and CondSCVI/DestVI.

However, (3) was achieved because get_normalized_expression and for that manner, differential_expression, are implemented in _rnamixin.py. So model that does not inherit this mixin will not have it and the message of "not implemented" will appear. In your implementation all is implemented directly in base path in da_pytoch, but like _de_core, we might be missing the wrapper to the DA in _rnamixin.py (like we do for DE) - so this is how I would recommend it to be implemented, otherwise it is has less control and will likely break for users who try out anything.
It shouldn’t be in rnamixin but yes we should also print an error.

So, seems CytoVi and mrVI added their own things on top of your DA, its fine and we have many cases that a model expand on a base/mixin function, question is whether your implementation can support those models implementations with minimum additions, if so, much code can be removed (we want to reuse as much as possible). In such case need to update those models as well and validate for correctness in their tutorials.
The MrVI changes don’t need to be reflected. For Jax it requires its own function but I wouldn’t write a wrapper to have same API in Jax.

lordy5 · 2025-12-18T08:13:45Z

Hi Ori and Can,
Thanks for the feedback and thoughts. I was also thinking about many of these things.

@ori-kron-wis

Yes however I haven't tested with other models yet as you said, and I guess we're not sure about cytovi yet.
Sounds, good, I will go ahead and add tests for other models. I was planning to, but wasn't sure which models it would be relevant for. I think I know which models to test with now after Can's feedback above.
Makes sense.
Yes, I definitely agree giving an error like (3) is a good way to handle unsupported models. However, I don't fully understand which way you're suggesting to do this. Is it that I should keep the base DA functionality in da_pytorch, and then make a wrapper in one of the mixins? Then for models that don't inherit from these mixins, there will be an error when trying to call DA? If that's the case, where would make the most sense to put this DA wrapper?
I'll take a look at the CytoVI implementation but I'm pretty sure I can use my implementation to remove duplicate code. But just to clarify, @canergen, were you saying above that the DA for MrVI should be standalone?
Ok will make sure to add that.

florianingelfinger · 2026-01-15T14:48:00Z

I think for CytoVI I would keep the DA as is (otherwise there may be some adaptation needed). I would also expose the aggregation function as for many cases the logsumexp favors high DA scores for outlier samples compared to sth like the logmedian.

add torch differential abundance code

1ca3a7d

lordy5 added the on-merge: backport to 1.4.x on-merge: backport to 1.4.x label Nov 24, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

a2f0b0d

for more information, see https://pre-commit.ci

lordy5 and others added 20 commits November 27, 2025 09:34

Merge branch 'main' into alex-differential-abundance

f3fdaa3

refactor: remove downsample param in favor of passing in a subset ann…

5ce218b

…data

[pre-commit.ci] auto fixes from pre-commit.com hooks

e328829

for more information, see https://pre-commit.ci

fix: use adata instead of self.adata

848bc9a

feat: add anndata dataloader in case user passes in a subset anndata

894fc9e

remove use of return_dist param in get_latent_representation

6c6efae

[pre-commit.ci] auto fixes from pre-commit.com hooks

6afa9db

for more information, see https://pre-commit.ci

fix: update method headers

1711505

tests: add skeleton for da tests

66abadc

use batch_size param in call of get_aggregated_posterior

4c0d207

remove sample param from get_aggregated_posterior

9b130d0

tests: finish da tests

fc6c9be

Merge branch 'main' into alex-differential-abundance

d02d681

tests: fix calls to DA and AP methods

e1c58c6

tests: fix da import

1bdf79f

tests: fix da testing and da code to pass tests

949f2ac

tests: fix da function to pass tests

51beb40

tests: pass in numpy arrays for indices, fix indices check

0e55d88

tests: fix empty numpy array in ap test

c153dc6

tests: add else statement for when key is None in DA test

39b89fc

lordy5 marked this pull request as ready for review December 14, 2025 00:13

lordy5 requested review from canergen and ori-kron-wis December 14, 2025 00:13

ori-kron-wis reviewed Dec 14, 2025

View reviewed changes

lordy5 added 16 commits January 6, 2026 18:17

Merge branch 'main' into alex-differential-abundance

7e5d3b4

move DA code to vaemixin

2febf39

update tests

e6f8cf7

add DA not implemented error to BaseModelClass

fe2fad0

fix DA errors

210ddf0

add DA tests for more models

c80b6c6

add DA support for MuData

852628b

Merge branch 'main' into alex-differential-abundance

0eead29

finish mudata tests

08d5c8a

fix tests

8dbd706

Merge branch 'main' into alex-differential-abundance

1603931

add error to ensure samples are categorical

4c5d564

fix DA test for spatial models

b908217

remove raised error from da function

bd353ab

fix labels key in scanvi test

8768d8b

fix totalvi test

cd443d3

lordy5 added 4 commits January 19, 2026 16:44

Merge branch 'main' into alex-differential-abundance

003d526

fix totalvi da test

68e1344

modify resolvi test

c310dad

fix branch logic

7f6d3d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add differential abundance for scvi-tools models #3618

feat: add differential abundance for scvi-tools models #3618

Uh oh!

lordy5 commented Nov 24, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 24, 2025 •

edited

Loading

Uh oh!

ori-kron-wis left a comment •

edited

Loading

Uh oh!

canergen commented Dec 14, 2025

Uh oh!

lordy5 commented Dec 18, 2025

Uh oh!

florianingelfinger commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: add differential abundance for scvi-tools models #3618

Are you sure you want to change the base?

feat: add differential abundance for scvi-tools models #3618

Uh oh!

Conversation

lordy5 commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ori-kron-wis left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

canergen commented Dec 14, 2025

Uh oh!

lordy5 commented Dec 18, 2025

Uh oh!

florianingelfinger commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lordy5 commented Nov 24, 2025 •

edited

Loading

codecov bot commented Nov 24, 2025 •

edited

Loading

ori-kron-wis left a comment •

edited

Loading