Skip to content

Evaluation, Reproducibility, Benchmarks Meeting 1

AReinke edited this page Sep 24, 2020 · 1 revision

Minutes of meeting 1

Date: 25th May 2020

Membership: Kevin Zhou (Lead), Lena Maier-Hein (Lead), Nicola Rieke, Stephen Aylward, Jens Petersen, Paul Jäger, Annika Reinke, Carole Sudre, David Zimmerer, Dan Tudosiu


TOP 1: Introduction of attendees

Each member briefly introduced himself and described the personal interest in the benchmarking working group.


TOP 2: Introduction of MONAI

Stephen, Lena and Kevin introduced the concept of MONAI.


TOP 3: Existing initiatives


TOP 4: What members would be willing to contribute

  • Tool for visualization of benchmarking results (DKFZ-CAMI)
  • Implementation of metrics (image wide, component-wise, multi-label) - classification segmentation regression (Carole)

TOP 5: What do we need?

  • Metrics depending on tasks with implementation
  • Comparability/Reproducibility:
    • Scripts for validating on a specific data sets
    • Semantic description of how the training was performed (including data sets used)
    • Addressing randomness in training/testing (e.g. random seeds)
  • Models as state-of-the-art/baseline methods (model zoo)
    • Baselines for a challenge for novice DL researchers ("easy to use")
  • Platform for benchmarking and publishing new methods
    • Similar to papers with code (+ automated)
    • Similar to open (post challenge) leaderboards for commonly used tasks/datasets
  • Quality control for "MONAI certified" data sets
  • Best practices for making trained model (+ "inference script") public
  • Incentives for data sharing
  • Infrastructure for participating in a challenge (including download scripts)
  • Supporting challenge organization
  • Best practices on reporting speed/memory requirements
    • Network benchmarks (DL speed/memory benchmarks to evaluate/score hardware)
  • Supporting fast inference (e.g. for docker-based evaluation)
  • Identifying performance bottlenecks in pipelines

TOP 6: Group organization

  • Slack channel (Paul, DKFZ): #project-monai-benchmark-wg
  • Github (Jens, DKFZ)
  • GoogleDrive: Use the one from MONAI (Lena)
  • Find meeting slot (Lena):
    • Wednesday, 2-3pm CEST monthly
    • Next meeting: 1st July
Clone this wiki locally