Add PreprocessingPipeline #3438

chrishalcrow · 2024-09-25T10:13:57Z

Add a PreprocessingPipeline class, which contains ordered preprocessing steps and their kwargs in a dictionary.

You can apply_pipeline to a recording to make a preprocessed recording:

preprocessor_dict = {'bandpass_filter': {'freq_max': 3000}, 'common_reference': {}}

from spikeinterface.preprocessing import apply_pipeline
preprocessed_recording = apply_pipeline(recording, preprocessor_dict)

Under the hood, this uses the apply method of the new PreprocessingPipeline. Users can also use the class directly:

from spikeinterface.preprocessing import PreprocessingPipeline
pipeline = PreprocessingPipeline(preprocessor_dict)
preprocessed_recording = pipeline.apply(recording, preprocessor_dict)

Also adds a function which takes in a provenance.json provenance file and makes a preprocessor_dict. So it's easy to extract preprocessing steps from a saved recording.

from spikeinterface.preprocessing import get_preprocessing_dict_from_json
my_dict = get_preprocessing_dict_from_json('/path/to/provenance.json')

After you load this, you can either apply the precomputable_kwargs or ignore them and compute on application:

# this will apply the precomputed stuff, like the `M` and `W` matrices from whitening:
pp_rec = si.apply_pipeline(rec, my_dict, ignore_precomputed_kwargs=False)
# this will ignore this stuff, and recompute the kwargs on application:
pp_rec = si.apply_pipeline(rec, my_dict, ignore_precomputed_kwargs=True)

PR allow for some cool things:

Users can pass a single dictionary to construct a preprocessed recording (as above). Hence it completes the “dictionary workflow”; since you can use dicts in sorting, run_sorter, and postprocessing in compute.
Users can easily visualise their preprocessing pipeline using the repr, including an HTML repr in Jupyter notebook
Increases portability between labs, and should make giving advice to users easier (from us, and from spike sorting developers), since we can just say "Oh, for KS4 NP2.0 we use this dict for preprocessing".
Increases the usefulness of our provenance system, since you can reconstruct human-readable preprocessing steps from the provenance.json file without the original recording (and worrying about paths).

The repr currently looks like this:

chrishalcrow added 2 commits September 25, 2024 11:06

add PreprocessingPipeline

d7bb297

Merge branch 'main' into preprocessing-pipeline

d0e74f7

chrishalcrow added enhancement New feature or request preprocessing Related to preprocessing module labels Sep 25, 2024

alejoe91 modified the milestone: 0.101.2 Oct 1, 2024

chrishalcrow and others added 4 commits December 6, 2024 15:57

add motion correct and nice html repr

8252f8f

add preprocessing names_to_funcitons dict

8436eb2

delete pp_name_to_function

5c19765

Merge branch 'main' into preprocessing-pipeline

5e202c3

chrishalcrow mentioned this pull request Feb 18, 2025

Add parents in HTML representation and always print class name #3700

Merged

chrishalcrow and others added 7 commits February 25, 2025 09:26

Unifty rerp with Extractors

36fa35a

Merge branch 'main' into preprocessing-pipeline

26eac1f

refactor

95de48f

add future

a3070fd

add first test

3b13d57

add tests and docs

5bcca60

test and doc improvements

2cc22a4

chrishalcrow mentioned this pull request Jun 3, 2025

Remove classes from extractor and preprocessing __init__ #3898

Merged

alejoe91 mentioned this pull request Jun 6, 2025

Save the preprocessing pipelines for simple reuse in curation GUI #1103

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add PreprocessingPipeline #3438

Add PreprocessingPipeline #3438

Uh oh!

chrishalcrow commented Sep 25, 2024 •

edited

Loading

Uh oh!

Uh oh!

Add PreprocessingPipeline #3438

Are you sure you want to change the base?

Add PreprocessingPipeline #3438

Uh oh!

Conversation

chrishalcrow commented Sep 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chrishalcrow commented Sep 25, 2024 •

edited

Loading