A machine learning pipeline for trustworthy and green models, enabling responsible AI:
- Explainable AI, using SHAP, LIME or both.
- Uncertainty estimation, using Bayesian dropout for neural networks.
- Carbon emissions tracking and reporting, using CodeCarbon.
d2m lets you easily create and evaluate machine learning models for tabular and time series data, with built-in data profiling and feature engineering.
Tested on:
- Linux
- macOS
- Windows with WSL 2
- Clone/download this repository.
- Place your datafiles (csv) in a folder with the name of your dataset (
DATASET) insideassets/data/raw/, so the path to the files isassets/data/raw/[DATASET]/. - Update
params.yamlwith the name of your dataset (DATASET), the target variable, and other configuration parameters. - Build Docker container:
docker build -t d2m -f Dockerfile .
- Run the container:
docker run -p 5000:5000 -it -v $(pwd)/assets:/usr/d2m/assets -v $(pwd)/.dvc:/usr/d2m/.dvc d2m
- Open the website at localhost:5000 to use the graphical user interface.
- Copy
params.yamlfrom the host to the container (findCONTAINER_NAMEby runningdocker ps):
docker cp params.yaml [CONTAINER_NAME]:/usr/d2m/params.yaml
- Inside the interactive session in the container, run:
docker exec [CONTAINER_NAME] dvc repro