[REF TEST] Rework DWI EPI pipeline #902

NicolasGensollen · 2023-03-31T12:44:49Z

This PR proposes to refactor the epi_pipeline which is a sub-workflow of the dwi_preprocessing_using_t1 pipeline. The "Description" section below gives an overview of the changes made in this PR and explains the reasons behind them.

Description

The base idea was to split the epi_pipeline in two workflows: perform_ants_registration and perform_dwi_epi_correction in order to simplifies things.
The epi_pipeline now does nothing more than calling these two sub-workflows with proper parameter passing.

These workflows are tested in isolation and therefore have now proper writing capabilities implemented through the output_dir argument. When a value is provided for this argument, a data sink node is added at the end of the workflow and the workflow's outputs are written to the corresponding location.

Perform ants registration

Testing the perform_ants_registration pipeline gave me some troubles because it relies on the ANTS RegistrationSynQuick utility which is stochastic. In order to be able to compare the output to a reference value, I had to add a random_seed parameter which is exposed to the user through the CLI, leveraging the recent work done in #916. This means that the non regression test of perform_ants_registration can only work with recent enough versions of ANTS supporting the random seed option.

Perform dwi epi correction

The perform_dwi_epi_correction pipeline is where things get bad. The pipeline basically work on each DWI direction separately and produces intermediary data for them (warped image, warp field, jacobian...). The amount of intermediary data produced is quite large, mostly for the following reasons:

The number of DWI directions (139 with the PREVDEMALS data used in the CI)
The data type which is float64 by default

We could tackle the first point by switching to a new test dataset with fewer directions, but this is not considered in this PR and we can always do that later independently of what was done here.

For the second point, I used the fact that some, not all unfortunately, utilities expose a parameter for precision. The main problem comes from the AntsApplyTransforms utility which only exposes a float boolean flag defaulting to False (i.e. computations are done in double 64bits precision). If set to True, it uses 32bits precision. I therefore added a use_double_precision boolean parameter to control this. I haven't exposed it to users through the CLI but we can do it if needed.

This mitigates the issue to some extent but the pipeline still needs a lot of disk space to run. The delete_cache option, implemented in #766 , enables the pipeline to clean these intermediary data at the end of the perform_dwi_epi_correction pipeline. Nonetheless, if the pipeline crashes a few times without cleaning stuffs, it will very quickly fill up the temp folder of any machine we'll use for CI.

A possible improvement that I didn't explore would be to specify a working directory to ANTS utilities (which is different from the Clinica's working_dir), such that they would write the intermediary data to a dedicated mass storage volume.

The perform_dwi_epi_correction was previously using a custom function to call antsApplyTransform as a subprocess. This function was deleted and the Nipype interface was used instead, which simplifies the code.

The perform_dwi_epi_correction produces a single output: a (208, 256, 256, 139) nifti image weighting 5.8G when using 64bits precision. The non regression test compares this output to a reference value. Because of the size of the image, the usual similarity_measure function that we use for tests was crashing (I suppose because of memory issues but I haven't investigated much...). I ended up implementing a few testing utility functions in clinica.utils.testing_utils to compare two nifti images. The assert_large_nifti_almost_equal function will compare each pair of (208, 256, 256) volumes iteratively, only loading two such volumes at the same time in memory. This is still very slow, so I might end up sampling volumes instead of comparing them all...

Required data for testing

Requires data PR https://github.com/aramis-lab/clinica_data_ci/pull/51

NicolasGensollen · 2023-07-07T09:44:59Z

Alright, I believe this is finally ready for reviews.
The PR is quite large so I did my best to explain the different changes and reasons for them in the opening message of the PR.
Let me know if there are unclear things.

MatthieuJoulot

LGTM ! Just one little thing !
Also, I will test with the new data.

clinica/pipelines/dwi_preprocessing_using_t1/dwi_preprocessing_using_t1_workflows.py

…re...

…r now...)

test/nonregression/testing_tools.py

MatthieuJoulot

LGTM !

NicolasGensollen force-pushed the rework-dwi-epi-pipeline branch from cb4ce9e to 712860c Compare April 5, 2023 08:29

NicolasGensollen force-pushed the rework-dwi-epi-pipeline branch from 712860c to 8b43ae2 Compare June 6, 2023 07:36

NicolasGensollen force-pushed the rework-dwi-epi-pipeline branch 4 times, most recently from 294c093 to 7c5874c Compare July 6, 2023 07:13

NicolasGensollen marked this pull request as ready for review July 7, 2023 09:42

NicolasGensollen requested review from ghisvail and MatthieuJoulot July 7, 2023 09:42

MatthieuJoulot reviewed Aug 28, 2023

View reviewed changes

clinica/pipelines/dwi_preprocessing_using_t1/dwi_preprocessing_using_t1_workflows.py Show resolved Hide resolved

NicolasGensollen force-pushed the rework-dwi-epi-pipeline branch from 98c3870 to df206b9 Compare August 31, 2023 12:55

NicolasGensollen added 18 commits September 15, 2023 14:15

Split EPI pipeline in two, improve names, add docs....

9abe759

small fix

4864903

add writing capability to sub-workflows

691cd01

add non regression test for dwi_ants_registration

774e006

small fix to test

b217637

try fix

2425462

skip parts of test for now...

7d119c3

add some doc and investigate test failures

789ce7b

implement random seed param for ants

7c9705e

update non regression test

8465c73

fix instantiation test

313e6db

typo

ad28dfb

try fix

6a53a7d

move function to image utils

35bc028

add a unit test for it

028d049

small fix

ec3ba9c

replace ants_apply_transforms custom function by the Nipype interface

1828e62

add non regression test for perform_dwi_epi_correction

dcd6e5e

NicolasGensollen added 24 commits September 15, 2023 14:15

try something different for tests

fc0e4b1

add unit tests and doc for new testing utils

b634c8e

add information about the amount of temporary data cleaned

ca4c88a

use 32bit precision in non regression test

d472c67

fix cache deletion

680d588

add unit tests for delete_directories

b997632

update outdated doc

19ecadd

small fixes with permission errors

317c1ea

split similarity measure and squeeze dummy dimensions

9ce33f1

make test less sensitive to numeric precision and platform architectu…

a4f6fac

…re...

small fix

02ccb41

add integration test for epi_pipeline

3937e74

fix base directory

7c97be0

debug

7d9e46a

remove the node removing dummy dimension

6bd935d

Add similarity_measure_large_nifti

4450413

fix bug in test_dwi_preprocessing_using_t1

e8614ed

update dwi tests to use similarity measure (tested only on Mac ARM fo…

73c6456

…r now...)

linting

c1c63aa

fix rotated b-vectors file not written in expected working dir

2df0351

remove forgotten commented code

74328a8

linting

c2ff540

fix broken tests

ddd74a5

remove unused imports and reduce test precision

26c5258

NicolasGensollen force-pushed the rework-dwi-epi-pipeline branch from 13f9f8f to 26c5258 Compare September 15, 2023 12:15

MatthieuJoulot reviewed Sep 26, 2023

View reviewed changes

test/nonregression/testing_tools.py Outdated Show resolved Hide resolved

MatthieuJoulot approved these changes Sep 27, 2023

View reviewed changes

Add missing docstring

66edcd6

NicolasGensollen merged commit ada5b61 into aramis-lab:dev Oct 1, 2023
18 checks passed

NicolasGensollen deleted the rework-dwi-epi-pipeline branch October 1, 2023 09:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REF TEST] Rework DWI EPI pipeline #902

[REF TEST] Rework DWI EPI pipeline #902

NicolasGensollen commented Mar 31, 2023 •

edited

Loading

NicolasGensollen commented Jul 7, 2023

MatthieuJoulot left a comment

MatthieuJoulot left a comment

[REF TEST] Rework DWI EPI pipeline #902

[REF TEST] Rework DWI EPI pipeline #902

Conversation

NicolasGensollen commented Mar 31, 2023 • edited Loading

Description

Perform ants registration

Perform dwi epi correction

Required data for testing

NicolasGensollen commented Jul 7, 2023

MatthieuJoulot left a comment

Choose a reason for hiding this comment

MatthieuJoulot left a comment

Choose a reason for hiding this comment

NicolasGensollen commented Mar 31, 2023 •

edited

Loading