Automated Model Monitoring & Retraining Dashboard

This project demonstrates a complete, closed-loop MLOps system built entirely within a GitHub repository. It automates the process of monitoring a machine learning model for performance degradation (like data drift) and orchestrates a human-in-the-loop workflow for seamless retraining and deployment.

%%{init: {'theme': 'dark'}}%%
graph LR
    %% Style Definitions for attractiveness %%
    classDef automation fill:#243B53,stroke:#fff,color:#fff
    classDef human fill:#D65D0E,stroke:#fff,color:#fff
    classDef success fill:#427B58,stroke:#fff,color:#fff
    classDef failure fill:#9D2929,stroke:#fff,color:#fff

    %% Node Definitions %%
    Monitor["📅 Scheduled<br/>Monitoring"]
    DriftCheck{"Drift<br/>Detected?"}
    Healthy["✅ System Healthy"]
    Issue["🚩 Issue Created"]
    
    Human["👤 Human<br/>Applies Label"]
    
    Retrain["🤖 Automated<br/>Retraining"]
    Validate{"New Model<br/>Better?"}
    Discard["❌ Model Discarded"]
    PR["🔀 PR Opened"]
    Merge["👩‍💻 Review &<br/>Merge"]

    %% Connections %%
    Monitor --> DriftCheck
    DriftCheck -- No --> Healthy
    DriftCheck -- Yes --> Issue
    
    Issue ---> Human
    Human ---> Retrain
    
    Retrain --> Validate
    Validate -- No --> Discard
    Validate -- Yes --> PR
    PR ---> Merge

    %% Apply Styles %%
    class Monitor,Retrain automation
    class Human,Merge human
    class Healthy,PR success
    class Issue,Discard failure

🚀 Core Features

Automated Monitoring: A scheduled GitHub Actions workflow runs daily to check for data drift and model performance issues using the Evidently AI library.
Live Dashboard: The monitoring report is automatically generated as an interactive HTML file and published to a live GitHub Pages URL for easy viewing.
Intelligent Alerting: If significant drift is detected, the workflow automatically creates a GitHub Issue, notifying maintainers and providing a link to the dashboard.
Human-in-the-Loop Trigger: The retraining process is initiated only when a human maintainer applies a specific label (e.g., retrain-model) to the alert issue.
Automated Retraining & Validation: A second workflow automatically retrains the model on fresh data, validates its performance against the old model, and records the results.
Automated Pull Request: If the new model is better, the workflow creates a Pull Request with the updated model file and validation metrics, ready for a final human review and merge to production.

🛠️ Tech Stack

Language: Python 3.11
Core Libraries:
- scikit-learn: For model training (RandomForestRegressor).
- pandas: For data manipulation.
- Evidently AI: For generating data drift and performance monitoring reports.
Orchestration & CI/CD: GitHub Actions
Dashboard Hosting: GitHub Pages
Alerting & Triggers: GitHub Issues and Labels

⚙️ How It Works

This repository contains two primary workflows that create the closed-loop system:

1. The Monitoring Workflow (`.github/workflows/monitor.yml`)

On a schedule (daily) or manual trigger:
The workflow checks out the repository code.
It runs the scripts/monitor.py script.
This script generates an Evidently AI dashboard and a JSON report.
The HTML dashboard is deployed to GitHub Pages.
The script checks the JSON report for drift.
If drift is detected, an issue is automatically created in the repository.

2. The Retraining Workflow (`.github/workflows/retrain.yml`)

Triggered when an issue is labeled with retrain-model:
The workflow checks out the repository code.
It runs the scripts/retrain.py script.
This script retrains the model and validates its performance against the old one, saving the results.
If the new model is better, the workflow commits the new model to a new branch.
An automated Pull Request is created for a final review and merge.

🔧 Project Setup

To run this project yourself, follow these steps:

Fork this repository.

Create and activate a Python virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows, use `.\.venv\Scripts\activate`

Install the required dependencies:
```
pip install -r requirements.txt
```
Enable GitHub Pages: In your repository's settings, go to "Pages" and set the "Source" to "GitHub Actions".
Create the retrain-model label: In your repository's "Issues" tab, go to "Labels" and create a new label named retrain-model.

You can now manually run the "Model Monitoring" workflow from the Actions tab to test the full pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
data		data
models		models
reports		reports
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
validation_results.json		validation_results.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated Model Monitoring & Retraining Dashboard

🚀 Core Features

🛠️ Tech Stack

⚙️ How It Works

1. The Monitoring Workflow (`.github/workflows/monitor.yml`)

2. The Retraining Workflow (`.github/workflows/retrain.yml`)

🔧 Project Setup

About

Uh oh!

Releases

Packages

Languages

License

harshithluc073/model-monitoring-dashboard

Folders and files

Latest commit

History

Repository files navigation

Automated Model Monitoring & Retraining Dashboard

🚀 Core Features

🛠️ Tech Stack

⚙️ How It Works

1. The Monitoring Workflow (.github/workflows/monitor.yml)

2. The Retraining Workflow (.github/workflows/retrain.yml)

🔧 Project Setup

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. The Monitoring Workflow (`.github/workflows/monitor.yml`)

2. The Retraining Workflow (`.github/workflows/retrain.yml`)

Packages