Predicting gene essentiality and drug response from perturbation screens in preclinical cancer models
LEAP is a novel ensemble framework designed to improve robustness and generalization in predicting perturbation responses from molecular profiles. By leveraging multiple DAMAE (Data Augmented Masked Autoencoder) representations and LASSO regressors, LEAP consistently outperforms state-of-the-art approaches in predicting gene essentiality and drug responses across diverse biological contexts.
make installThis will install uv (if needed) and all project dependencies.
Run quality checks before committing:
make checks # Run pre-commit hooks (linting, formatting, type checking)
make tests # Run tests with coveragemake cleanRemoves all temporary files, caches, and build artifacts.
make clean-dataRemoves all processed data, trained models, and results. Warning: This action cannot be undone!
sh run_pipeline.shRuns the full LEAP pipeline end-to-end:
- Pretrain representations
- Train regression heads for multiple tasks, models, and seeds
- Ensemble predictions
The pipeline is configured to run on task 1 with the mae_ps_enet model across 5 different seeds.
If you use LEAP in your research, please cite our paper:
@misc{bodinier2025predictinggeneessentialitydrug,
title={Predicting gene essentiality and drug response from perturbation screens in preclinical cancer models with LEAP: Layered Ensemble of Autoencoders and Predictors},
author={Barbara Bodinier and Gaetan Dissez and Lucile Ter-Minassian and Linus Bleistein and Roberta Codato and John Klein and Eric Durand and Antonin Dauvin},
year={2025},
eprint={2502.15646},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2502.15646},
}This project is licensed under the MIT License - see the LICENSE file for details.
- Barbara Bodinier
- Gaetan Dissez
- Lucile Ter-Minassian
- Linus Bleistein
- Roberta Codato
- John Klein
- Eric Durand
- Antonin Dauvin