Update stats plots, add longitudinal sample size calculation #98

PaulBautin · 2021-01-25T17:07:21Z

This PR intends to homogenize notations and conventions between the graphs presented in the manuscript and the "csa_atrophy" repo.

Done:

Update plots: sample_size, error_function_of_csa
Update legends: boxplot_atrophy, boxplot_csa
Draw bissection automatically on the boxplot_atrophy graph
Add graph showing error in function of CSA
Add longitudinal study sample size computation and update ref for sample size computation
Update README

FIX #95, FIX #92, FIX #100, FIX #80, FIX #103

- update plots: sample_size, error_function_of_csa - update legends: boxplot_atrophy, boxplot_csa

- expected trend and mean values on CSA boxplot - mean values on atrophy boxplot - add function for adding pearson's r and p-value stats - add diff and std diff column in dataframe for sample size computation - add longitudinal study sample size change: - correct difference between means (atrophy_% * CSA) on sample size plot - correct x_label on error_function_of_CSA - correct x_label on error_function_of_intra_cov - automatize the detection of outliers on error_function_of_intra_cov_outlier

# Conflicts: # csa_rescale_stat.py

…ore this diff was computed from mean CSA across transformations)

PaulBautin · 2021-05-13T21:43:13Z

Up to now the computed longitudinal sample sizes were small (< 1). After investigation, we observed that the SD of the difference of measured CSA across subjects were computed using mean CSA across transformations ex:
diff(sI, rX) = Mean[CSA(sI, r1, :)] - Mean[CSA(sI, rX, :)] (1)
This had for consequence that the SD of differences did not take into account the variability due to transformations.

Implemented in 7874439, contrary to formula (1) program does not mean CSA across transformations but randomly samples a CSA value for each subject. ex:
diff(sI, rX) = CSA(sI, r1, tY) - CSA(sI, rX, tZ) (2)
The results with this method are much closer to what was previously found in literature. However, it also puts in evidence a large variability in longitudinal sample sizes due to transformations. This is surprising because the mean intra-subject SD is relatively small (hence, I would not expect such an influence of the transformation-related variability).

jcohenadad · 2021-05-14T19:59:27Z

csa_rescale_stat.py

+    df_sub['perc_error'] = 100 * (df_sub['mean'] - df_sub['theoretic_csa']).div(df_sub['theoretic_csa'])
+    diff = []
+    for rescale, group in df.groupby('rescale'):
+        for sub, subgroup in group.groupby('subject'):


add comment/explanation

I've added explanations for the sample size function in commit f50f18a

- add comments

PaulBautin · 2021-05-20T15:36:07Z

With commit 05d7fcd using difference formula (tY and tZ are two different transforms):

diff(sI, rX) = CSA(sI, r1, tY) - CSA(sI, rX, tZ) (2)

longitudinal sample size variability is relatively important

Also note that, when looking at between-group differences (vs. paired differences as described above), the formula was also updated as follows:

CSA(sI, rX) = CSA(sI, rX, tZ) and not CSA(sI, rX) = MEAN[CSA(sI, rX, :)]

Results seem to vary much less: SD of sample size no more than 3% of sample size (between groups).

Therefore, my best guess is that the important variability found computing longitudinal sample sizes are mostly due to the variability of CSA measures between scalings (which has already been shown in article).

@jcohenadad, should we continue with these results? My idea is now to keep the Monte Carlo simulations for both sample size computations.

jcohenadad · 2021-05-20T17:15:53Z

@PaulBautin This is an interesting investigation but I need more guidance to understand the formula described in #98 (comment). Without the context of the code I cannot advise on what is the most appropriate solution. I suggest we discuss it in a meeting.

- normalize by the square of rescale for df['Normalized CSA in mm²'] - use poly1d when plotting trends in plots

- append fake values for pearson and and p_value for rescale =1

- un-comment concatenating csv files

- change iteration number for computing sample size and print message

PaulBautin · 2021-05-27T13:13:16Z

@jcohenadad, could you review? I think this PR is ready to be merged (plots in PR match plots in article).

PaulBautin · 2021-07-09T15:59:56Z

@jcohenadad, could you review? This PR should be merged into master because plots and stats for the article are based on this PR.

jcohenadad · 2021-07-12T22:19:31Z

sorry-- realistically i will not have time to review

update plots to match manuscript

fdd81a1

- update plots: sample_size, error_function_of_csa - update legends: boxplot_atrophy, boxplot_csa

PaulBautin marked this pull request as draft January 25, 2021 17:22

PaulBautin added 4 commits March 11, 2021 13:09

- update README for longitudinal study sample size

d48a7b8

update ref in csa_rescale_stat for sample size computation

645dcc1

correct rescale_estimated_subject in README

f4f5c4b

PaulBautin marked this pull request as ready for review March 11, 2021 18:27

jcohenadad changed the title ~~Update plots to match manuscript~~ Update stats plots, add longitudinal sample size calculation Mar 17, 2021

jcohenadad and others added 11 commits March 17, 2021 17:01

Create output folder if does not exist

79e1269

change std to var in sample size

08b1d01

Added TODOs

c0f0596

Merge remote-tracking branch 'origin/graph' into graph

deb15c6

# Conflicts: # csa_rescale_stat.py

- correct sample size formula

de28fc3

- correct diff formula by integrating transformation variability (bef…

f72e107

…ore this diff was computed from mean CSA across transformations)

- correct sample size formula

d39cc25

- remove replacement in diff formula for df_sub

ab54398

- use absolute for mean diff and make difference subject dependant

a67f773

- remove diff abs because useless (SD uses squared difference anyway)

ab1eae1

- remove diff abs because useless

7874439

jcohenadad reviewed May 14, 2021

View reviewed changes

PaulBautin added 2 commits May 20, 2021 10:43

- change formula for sample size

e599ab7

- add comments

- Monte Carlo simulation for between group sample size computation

05d7fcd

PaulBautin added 5 commits May 20, 2021 15:06

- add comments for sample size function

f50f18a

- improve comments for formula var and var_diff

be0ebb4

- limit usage of rescale_area to plots

31d2ab1

- remove rounding before boxplots for csa and atrophy

02c47e7

- normalize by the square of rescale for df['Normalized CSA in mm²'] - use poly1d when plotting trends in plots

- scatter colorbar takes discrete values

bb990c8

- append fake values for pearson and and p_value for rescale =1

PaulBautin added 2 commits May 27, 2021 08:55

- compute sample size using 500 itterations

7676fcb

- un-comment concatenating csv files

- update sample size plot to match article

08ff3a5

- change iteration number for computing sample size and print message

- remove ceil sur chaque calcul de sample size

d35b6f2

sandrinebedard and others added 16 commits July 25, 2024 17:06

add egg info

120fa41

change sct_deepseg for sct_deepseg_sc

773c191

change disc file label

e745a3c

add fix for qform sfrom and fix typo disc labels

25524fd

change for sct

656e174

modify for python 3.9 compatibility

c849a24

setup for compute canada

9511c21

rsync disc file

96c1b1d

rm qc from sct_deepseg

08bfc54

fix missing eextension

6e36cbf

remove extra QC

a5942d4

remove one qc report

28b8c78

remove all QC reports

5d37e0a

remove all qc reports

bc3e882

add logging

0b40fb6

Merge branch 'sb/update-for-ca-python3.9' into graph

d7c5bf9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update stats plots, add longitudinal sample size calculation #98

Update stats plots, add longitudinal sample size calculation #98

PaulBautin commented Jan 25, 2021 •

edited

Loading

PaulBautin commented May 13, 2021 •

edited by jcohenadad

Loading

jcohenadad May 14, 2021

PaulBautin May 20, 2021

PaulBautin commented May 20, 2021 •

edited by jcohenadad

Loading

jcohenadad commented May 20, 2021

PaulBautin commented May 27, 2021

PaulBautin commented Jul 9, 2021

jcohenadad commented Jul 12, 2021

Update stats plots, add longitudinal sample size calculation #98

Are you sure you want to change the base?

Update stats plots, add longitudinal sample size calculation #98

Conversation

PaulBautin commented Jan 25, 2021 • edited Loading

PaulBautin commented May 13, 2021 • edited by jcohenadad Loading

jcohenadad May 14, 2021

Choose a reason for hiding this comment

PaulBautin May 20, 2021

Choose a reason for hiding this comment

PaulBautin commented May 20, 2021 • edited by jcohenadad Loading

jcohenadad commented May 20, 2021

PaulBautin commented May 27, 2021

PaulBautin commented Jul 9, 2021

jcohenadad commented Jul 12, 2021

PaulBautin commented Jan 25, 2021 •

edited

Loading

PaulBautin commented May 13, 2021 •

edited by jcohenadad

Loading

PaulBautin commented May 20, 2021 •

edited by jcohenadad

Loading