AI for medical imaging — Fall 2025 group project

Project overview

The project is based around the SegTHOR challenge data, which was kindly allowed by Caroline Petitjean (challenge organizer) to use for the course. The challenge was originally on the segmentation of different organs: heart, aorta, esophagus and trachea.

Structure

The project is decomposed in two main parts: weekly individual assignments, and group project:

Assignment 01: slicing of the data and 2D, 3D visualization;
Assignment 02: (to be revealed) something affine;
Assignment 03: Running the baseline, stitching slices to 3D volume and computing metrics;
Group project: improve the baseline by adding elements and ideas from the course.

For deadlines and submission guidelines, see the individual assignment pages and the Canvas page.

Codebase features

This codebase is given as a starting point, to provide an initial neural network that converges during training. (For broader context, this is itself a fork of an older conference tutorial we gave few years ago.) It also provides facilities to locally run some test on a laptop, with a toy dataset and dummy network.

Summary of codebase (in PyTorch)

slicing the 3D Nifti files to 2D .png; To implement as assignment 01
stitching 2D .png slices to 3D volume compatible with initial nifti files; To implement as assignment 03
basic 2D segmentation network;
basic training and printing with cross-entroly loss and Adam;
partial cross-entropy alternative as a loss (to disable one class during training);
debug options and facilities (cpu version, "dummy" network, smaller datasets);
saving of predictions as .png;
log the 2D DSC and cross-entropy over time, with basic plotting;
tool to compare different segmentations (viewer/viewer.py).

Some recurrent questions might be addressed here directly. As such, it is expected that small change or additions to this readme to be made.

Codebase use

In the following, a line starting by $ usually means it is meant to be typed in the terminal (bash, zsh, fish, ...), whereas no symbol might indicate some python code.

Setting up the environment

$ git clone https://github.com/HKervadec/ai4mi_project.git
$ cd ai4mi_project
$ git submodule init
$ git submodule update

This codebase was written for a somewhat recent python (3.10 or more recent). (Note: Ubuntu and some other Linux distributions might make the distasteful choice to have python pointing to 2.+ version, and require to type python3 explicitly.) The required packages are listed in requirements.txt and a virtual environment can easily be created from it through pip:

$ python -m venv ai4mi
$ source ai4mi/bin/activate
$ which python  # ensure this is not your system's python anymore
$ python -m pip install -r requirements.txt

Conda is an alternative to pip, but is recommended not to mix conda install and pip install.

Setting up the environment - Some troubleshooting for windows users

These steps assume you are using Git Bash + Anaconda + an IDE (e.g., PyCharm).

Open git bash and run: Step 1:

$ git clone https://github.com/HKervadec/ai4mi_project.git
$ cd ai4mi_project
$ git submodule init
$ git submodule update

Step 2:

# 1) Create a fresh conda env with Python 3.10+ (matches project note)
$ conda create -n ai4mi python=3.10 -y

# 2) Activate it
$ conda activate ai4mi

# 3) (Optional but nice) make sure pip is present/updated
$ python -m pip install --upgrade pip

# 4) From the repo folder, install dependencies with pip
$ python -m pip install -r requirements.txt

Some common troubleshooting for windows users:

In case in bash u got - conda: command not found

Open Anaconda Prompt:

conda init bash

#Find where conda is installed
where conda

Yous should get sth like - C:\Users<YourName>\anaconda3

Close and open git bash - change CONDA_HOME in the code below

$ CONDA_HOME="/c/Users/<YourName>/anaconda3"
if [ -f "$CONDA_HOME/etc/profile.d/conda.sh" ]; then
    . "$CONDA_HOME/etc/profile.d/conda.sh"
else
    export PATH="$CONDA_HOME:$CONDA_HOME/Scripts:$CONDA_HOME/Library/bin:$PATH"
fi

You can also create new conda environment in anaconda prompt

Getting the data

The synthetic dataset is generated randomly, whereas for Segthor it is required to put the file segthor_train.zip (required a UvA account) in the data/ folder. If the computer running it is powerful enough, the recipe for data/SEGTHOR can be modified in the Makefile to enable multi-processing (-p -1 option, see python slice_segthor.py --help or its code directly).

$ make data/TOY2
$ make data/SEGTHOR

For windows users, you can use the following instead

$ rm -rf data/TOY2_tmp data/TOY2
$ python gen_two_circles.py --dest data/TOY2_tmp -n 1000 100 -r 25 -wh 256 256
$ mv data/TOY2_tmp data/TOY2

$ sha256sum -c data/segthor_train.sha256
$ unzip -q data/segthor_train.zip

$ rm -rf data/SEGTHOR_tmp data/SEGTHOR
$ python  slice_segthor.py --source_dir data/segthor_train --dest_dir data/SEGTHOR_tmp \
         --shape 256 256 --retain 10
$ mv data/SEGTHOR_tmp data/SEGTHOR

Fixing and reslicing the data (solutions to assignments)

Solution from scratch (make data/SEGTHOR_CLEAN CFLAGS=-O) (read what python -O does):

$ make data/SEGTHOR_CLEAN CFLAGS=-O -n  # Will display the commands that will run, easy to inspect:
rm -rf data/segthor_fixed_tmp data/segthor_fixed
python -O sabotage.py --mode inv --source_dir data/segthor_train --dest_dir data/segthor_fixed_tmp -K 2 --regex_gt "GT.nii.gz" -p 4
mv data/segthor_fixed_tmp data/segthor_fixed
rm -rf data/SEGTHOR_CLEAN_tmp data/SEGTHOR_CLEAN
python -O slice_segthor.py --source_dir data/segthor_fixed --dest_dir data/SEGTHOR_CLEAN_tmp \
        --shape 256 256 --retain 10 -p -1
mv data/SEGTHOR_CLEAN_tmp data/SEGTHOR_CLEAN


$ make data/SEGTHOR_CLEAN CFLAGS=-O # Or, for windows users, copy the previously displayed commands
$ python slice_segthor.py --help   # May be useful for the project

Alternatively, if you had saved the "fixed" scans within the data/segthor_train folder under the GT_fixed.nii.gz name, you can trivially create a data/segthor_fixed folder with:

cp -r data/segthor_train data/segthor_fixed
rm data/segthor_fixed/train/*/GT.nii.gz
for p in data/segthor_fixed/train/*/; do mv $p/GT_fixed.nii.gz $p/GT.nii.gz ; done

Viewing the data

The data can be viewed in different ways:

looking directly at the .png in the sliced folder (data/TOY2, data/SEGTHOR);
using the provided "viewer" to compare segmentations (see below);
opening the Nifti files from data/segthor_train with 3D Slicer or ITK Snap.

2D viewer

Comparing some predictions with the provided viewer (right-click to go to the next set of images, left-click to go back), or simply looking at the data:

$ python viewer/viewer.py --img_source data/TOY2/val/img \
    data/TOY2/val/gt \
    --show_img -C 256 --no_contour

Note: if using it from a SSH session, it requires X to be forwarded (Unix/BSD, Windows) for it to work. Note that X forwarding also needs to be enabled on the server side.

For Segthor, comparing for instance the original data, the fixed ones, and the validation predictions at epoch 10:

$ python viewer/viewer.py --img_source data/SEGTHOR_CLEAN/val/img \
    data/SEGTHOR/val/gt data/SEGTHOR_CLEAN/val/gt results/segthor/ce/iter010/val \
    -n 2 -C 5 --remap "{63: 1, 126: 2, 189: 3, 252: 4}" \
    --legend --class_names background esophagus heart trachea aorta

3D viewers

3D Slicer and ITK Snap are two popular viewers for medical data, here comparing GT.nii.gz and the corresponding stitched prediction Patient_01.nii.gz:

Zooming on the prediction with smoothing disabled:

Submission and scoring

Groups will have to submit:

archive of the git repo with the whole project, which includes:
- slicing (if any) and any other pre-processing;
- training;
- post-processing when applicable;
- inference;
- metrics computation/scripts to run the metrics submodule;
the best trained model;
predictions on the test set (required @uva.nl account) (sha256sum -c data/test.zip.sha256 as optional checksum);
predictions on the group's internal validation set, the labels of their validation set, and the metrics they computed (akin to Assignment 3).

The main criterions for scoring will include (listed here only for convenience, please see Canvas for reference rubric):

improvement or lack thereof of performances over baseline;
code quality/clear git use;
the final choice of metrics (they need to be in 3D);
correctness of the computed metrics (on the validation set);
oral presentation.

Packing the code

$ git bundle create group-XX.bundle master

Saving the best model

torch.save(net, args.dest / "bestmodel-group-XX.pkl")

Archiving everything for submission

All files should be grouped in single folder with the following structure

group-XX/
    test/
        pred/
            Patient_41.nii.gz
            Patient_42.nii.gz
            ...
    val/
        pred/
            Patient_21.nii.gz
            Patient_32.nii.gz
            ...
        gt/
            Patient_21.nii.gz
            Patient_32.nii.gz
            ...
        metric01.npz
        metric02.npz
        ...
    group-XX.bundle
    bestmodel-group-XX.pkl

The metrics should be a .npz archives, that maps patient ID (e.g., Patient_21) to a ndarray with shape KxD (or K if D = 1), with K the number of classes and D the eventual dimensionality of the metric (can be simply 1). Ultimately it is the same format as Distorch from Assignment 3.

The folder should then be tarred and compressed, e.g.:

Example using Zstandard:
$ tar cf - group-XX/ | zstd -T0 -3 > group-XX.tar.zst
Example using gunzip:
$ tar cf group-XX.tar.gz - group-XX/

Known issues

Cannot pickle lambda in the dataloader

Some installs (probably due to Python/Pytorch version mismatch) throw an error about an inability to pickle lambda functions (at the dataloader stage). Short of reinstalling everything, setting the number of workers to 0 seems to get around the problem (--num_workers 0).

Pytorch not compiled for Numpy 2.0

It may happen that Pytorch, when installed through pip, was compiled for Numpy 1.x, which creates some inconsistencies. Downgrading Numpy seems to solve it: pip install --upgrade "numpy<2"

Viewer on Windows

Windows has different paths names (\ in stead of /), so the default regex in the viewer needs to be changed to --id_regex=".*\\\\(.*).png".

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
viewer @ ae23e7c		viewer @ ae23e7c
weekly_assignments		weekly_assignments
.gitignore		.gitignore
.gitmodules		.gitmodules
3dslicer.png		3dslicer.png
3dslicer_zoom.png		3dslicer_zoom.png
ENet.py		ENet.py
LICENSE		LICENSE
Makefile		Makefile
ShallowNet.py		ShallowNet.py
dataset.py		dataset.py
dice_val.png		dice_val.png
gen_two_circles.py		gen_two_circles.py
git.md		git.md
losses.py		losses.py
main.py		main.py
plot.py		plot.py
readme.md		readme.md
requirements.txt		requirements.txt
sabotage.py		sabotage.py
slice_segthor.py		slice_segthor.py
stitch.py		stitch.py
utils.py		utils.py
viewer_segthor.png		viewer_segthor.png
viewer_toy.png		viewer_toy.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI for medical imaging — Fall 2025 group project

Project overview

Structure

Codebase features

Codebase use

Setting up the environment

Setting up the environment - Some troubleshooting for windows users

Getting the data

Fixing and reslicing the data (solutions to assignments)

Viewing the data

2D viewer

3D viewers

Submission and scoring

Packing the code

Saving the best model

Archiving everything for submission

Known issues

Cannot pickle lambda in the dataloader

Pytorch not compiled for Numpy 2.0

Viewer on Windows

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

HKervadec/ai4mi_project

Folders and files

Latest commit

History

Repository files navigation

AI for medical imaging — Fall 2025 group project

Project overview

Structure

Codebase features

Codebase use

Setting up the environment

Setting up the environment - Some troubleshooting for windows users

Getting the data

Fixing and reslicing the data (solutions to assignments)

Viewing the data

2D viewer

3D viewers

Submission and scoring

Packing the code

Saving the best model

Archiving everything for submission

Known issues

Cannot pickle lambda in the dataloader

Pytorch not compiled for Numpy 2.0

Viewer on Windows

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages