Risk markers by sex for in-hospital mortality in patients with acute coronary syndrome: a machine learning approach
This repository identified in-hospital mortality markers for women and men in Acute Myocardial Infarction (ACS) sub-populations from a public database of EHR using machine learning methods.
This repository is organized as follows:
- SQL_extraction: this folder contains a set of scripts to extract the STEMI and NSTEMI patients from MIMIC III database based on SQL. The folder describes the view creates to extract structure and clinical notes. Also, it is described step by step as the data were extracted for further processing.
- Notebooks: contains the jupyter notebooks for exploring, pre-processing and splitting the EHR data.
- Hyperparameter_optimization: contains the code used to hyperparameter optimization with 10 repetitions of stratified 10-fold cross-validation. The models evaluated Logistic Regression, Random Forest Classifier,Support Vector Machine, and XGBoost.
- Sex_specific_risk_markers: contains the codes used for obtaining the sex-specific markers for each ACS-subpopulation.
- Cox model: contain the codes used for computing a multivariable Cox regression model.
- Python 3.6
- Numpy
- Pandas
- Matplotlib
- xgboost
- statsmodels
- jupiter notebook
You can create a conda environment with all the dependencies using the environment.yml file in this repository.
conda env create --file markers_ACS.yml
@article{VAZQUEZ2021100791,
title = {Risk markers by sex for in-hospital mortality in patients with acute coronary syndrome: A machine learning approach},
journal = {Informatics in Medicine Unlocked},
volume = {27},
pages = {100791},
year = {2021},
issn = {2352-9148},
doi = {https://doi.org/10.1016/j.imu.2021.100791},
url = {https://www.sciencedirect.com/science/article/pii/S2352914821002616},
author = {Blanca Vázquez and Gibran Fuentes-Pineda and Fabian García and Gabriela Borrayo and Juan Prohías},
}