Add KeOps MMD detector #548

arnaudvl · 2022-07-06T11:13:42Z

Add MMD detector using the KeOps (PyTorch) backend to further accelerate drift detection and scale up to larger datasets. This PR needs to be made compatible with the optional dependency management (incl. #538 and related).

This PR includes:

Once this PR is merged, it will be followed up by a similar implementation for the Learned (Deep) Kernel detector.

review-notebook-app · 2022-07-06T13:20:29Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

ascillitoe · 2022-07-14T09:17:24Z

@arnaudvl I shall resolve these conflicts and then review once we have #537 merged.

I'm also adding @mauicv for review specifically to check my additions wrt to optional dependency handling (once I've added!)

ascillitoe · 2022-07-14T09:20:07Z

Additional note: Need to check suitable error is raised when passed to save_detector. Implementing save/load functionality can be left to a future PR.

jklaise

Please refer to my comments.

alibi_detect/cd/keops/mmd.py

ascillitoe

Generally looks very nice!

Just a few minor comments on my end, plus I have not investigated the final task of testing sigma_mean in batch and non-batch settings.

README.md

doc/source/cd/methods/mmddrift.ipynb

alibi_detect/cd/keops/tests/test_mmd_keops.py

ascillitoe · 2022-08-15T17:24:32Z

alibi_detect/cd/mmd.py

+        batch_size_permutations
+            KeOps computes the n_permutations of the MMD^2 statistics in chunks of batch_size_permutations.
+            Only relevant for 'keops' backend.


Maybe we should open an issue for this?

ascillitoe · 2022-08-15T17:33:45Z

alibi_detect/cd/mmd.py

+        elif backend == 'pytorch' and has_pytorch:
+            pop_kwargs += ['batch_size_permutations']
+            detector = MMDDriftTorch
+        else:


I mistakingly opened a new issue #576 (comment) for this as thought these "if statements" were already present. Since they are actually added here (for the new pop_kwargs bit), it would make more sense to fix here.

The has_tensorflow and has_pytorch are unnecessary as BackendValidator should have already raised an error if backend='tensorflow' and has_tensorflow=False, or the PyTorch equivalent.

ascillitoe · 2022-08-15T17:40:11Z

alibi_detect/cd/pytorch/mmd.py

@@ -89,7 +89,7 @@ def __init__(
        # initialize kernel
        sigma = torch.from_numpy(sigma).to(self.device) if isinstance(sigma,  # type: ignore[assignment]
                                                                      np.ndarray) else None
-        self.kernel = kernel(sigma) if kernel == GaussianRBF else kernel
+        self.kernel = kernel(sigma).to(self.device) if kernel == GaussianRBF else kernel


Not sure we actually fixed this?? I have opened an issue (#586) and will fix it tomorrow.

alibi_detect/utils/frameworks.py

ascillitoe · 2022-08-15T17:59:50Z

alibi_detect/utils/keops/kernels.py

+        return self.log_sigma.exp()
+
+    def forward(self, x: LazyTensor, y: LazyTensor, infer_sigma: bool = False) -> LazyTensor:
+


I agree with opening an issue since this applies to all kernels (even if the issue is just to review docstring conventions with the conclusion being keep-as-is!).

codecov-commenter · 2022-08-16T15:00:52Z

Codecov Report

❗ No coverage uploaded for pull request base (master@ed519e3). Click here to learn what that means.
The diff coverage is n/a.

@@            Coverage Diff            @@
##             master     #548   +/-   ##
=========================================
  Coverage          ?   83.51%           
=========================================
  Files             ?      207           
  Lines             ?    13777           
  Branches          ?        0           
=========================================
  Hits              ?    11506           
  Misses            ?     2271           
  Partials          ?        0

jklaise

LGTM!

arnaudvl added 9 commits June 24, 2022 15:43

first commit keops

b322dbf

update kernel and mmd keops

29ad4d9

allow multiple kernel bandwidths for keops

dd2d13e

fix bug

0a7944c

update mmd

17d1662

remove learned kernel and base kernel_matrix MMD function

cf528f7

unify batched mmd2

38ec19b

update keops mmd

e943c6c

update docs and kernel import

0487cb2

bugfixes

a6a4641

arnaudvl changed the title ~~Add KeOps MMD detector~~ WIP: Add KeOps MMD detector Jul 8, 2022

arnaudvl added 2 commits July 8, 2022 14:38

remove unused imports

e2b27f5

add benchmarking example

d442be8

arnaudvl requested review from ascillitoe and jklaise July 8, 2022 13:50

arnaudvl added 5 commits July 8, 2022 14:54

update test mmd

244ceb2

add test mmd keops

f913f5b

update readme

2da4a9a

bugfix kernel and update mmd test

c49f1d2

remove print from test

75481cc

arnaudvl changed the title ~~WIP: Add KeOps MMD detector~~ [WIP] Add KeOps MMD detector Jul 8, 2022

update keops tests

e6996b9

arnaudvl changed the title ~~[WIP] Add KeOps MMD detector~~ Add KeOps MMD detector Jul 8, 2022

ascillitoe requested a review from mauicv July 14, 2022 09:17

ascillitoe added 3 commits July 26, 2022 16:19

Merge master and resolve conflicts

afda208

Add save warning and update tests

c87109f

Update setup and associated docs

eb307b6

jklaise suggested changes Jul 29, 2022

View reviewed changes

mauicv mentioned this pull request Jul 29, 2022

Document ERROR_TYPES in missing_optional_dependencies #577

Closed

ascillitoe added 3 commits July 29, 2022 14:23

Skip keops tests on MacOS

74ff992

Add note to docs about linux-only support for keops

7c5e70d

Add batch_size_permutations to pydantic models

fc12b9d

mauicv reviewed Aug 8, 2022

View reviewed changes

alibi_detect/cd/keops/mmd.py Show resolved Hide resolved

arnaudvl added 12 commits August 9, 2022 11:41

remove print

f6b331b

remove unnecessary comment

ace20cc

change default bandwidth fn to None

718fb85

update infer sigma

5922e3f

update test warning, update and clarify keops kernels logic

b8adfbe

clean up

015cc5e

update docstring

148019a

fix bug

7453368

undo unnecessary kwarg removal

2d88bfc

make test consistent with torch/tf backends

54df257

add _mmd2 test

211eeb9

remove unused import

f98fd83

ascillitoe approved these changes Aug 15, 2022

View reviewed changes

mauicv mentioned this pull request Aug 16, 2022

Correct import checks in framework.py #590

Open

arnaudvl added 2 commits August 16, 2022 15:13

clarify docs, remove redundant framework checks

751d3a0

remove print

7c2d781

arnaudvl added 4 commits August 16, 2022 16:19

update docs keops

3f69740

batched version of sigma_mean part 1

ac5fe64

remove unused import

4ce018b

update keops kernels test

95634a1

jklaise approved these changes Aug 18, 2022

View reviewed changes

arnaudvl merged commit 705b718 into SeldonIO:master Aug 19, 2022

arnaudvl mentioned this pull request Aug 26, 2022

Learned kernel MMD with KeOps backend #602

Merged

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KeOps MMD detector #548

Add KeOps MMD detector #548

arnaudvl commented Jul 6, 2022 •

edited

Loading

review-notebook-app bot commented Jul 6, 2022

ascillitoe commented Jul 14, 2022 •

edited

Loading

ascillitoe commented Jul 14, 2022

jklaise left a comment

ascillitoe left a comment

ascillitoe Aug 15, 2022

ascillitoe Aug 15, 2022

ascillitoe Aug 15, 2022

ascillitoe Aug 15, 2022

codecov-commenter commented Aug 16, 2022 •

edited

Loading

jklaise left a comment

		return self.log_sigma.exp()

		def forward(self, x: LazyTensor, y: LazyTensor, infer_sigma: bool = False) -> LazyTensor:

Add KeOps MMD detector #548

Add KeOps MMD detector #548

Conversation

arnaudvl commented Jul 6, 2022 • edited Loading

review-notebook-app bot commented Jul 6, 2022

ascillitoe commented Jul 14, 2022 • edited Loading

ascillitoe commented Jul 14, 2022

jklaise left a comment

Choose a reason for hiding this comment

ascillitoe left a comment

Choose a reason for hiding this comment

ascillitoe Aug 15, 2022

Choose a reason for hiding this comment

ascillitoe Aug 15, 2022

Choose a reason for hiding this comment

ascillitoe Aug 15, 2022

Choose a reason for hiding this comment

ascillitoe Aug 15, 2022

Choose a reason for hiding this comment

codecov-commenter commented Aug 16, 2022 • edited Loading

Codecov Report

jklaise left a comment

Choose a reason for hiding this comment

arnaudvl commented Jul 6, 2022 •

edited

Loading

ascillitoe commented Jul 14, 2022 •

edited

Loading

codecov-commenter commented Aug 16, 2022 •

edited

Loading