imbalanced-data

A Jupyter notebook that applies machine learning techniques to detect credit card fraud on imbalanced data. It covers data preprocessing, EDA, handling class imbalance, training classifiers (Logistic Regression, Decision Tree, RandomForest), and saving the trained models.

machine-learning logistic-regression decision-tree imbalanced-data random-forest-classifier

Updated Sep 13, 2024
Jupyter Notebook

mdzaheerjk / Feature-Engineering

Sponsor

Star

🔧 Feature Engineering made simple & practical 📊 Handle missing values, imbalance, outliers & encodings 📝 Interactive Jupyter notebooks + reusable scripts 🚀 Supercharge ML models with better features

outlier-detection feature-engineering imbalanced-data smote onehot-encoding

Updated Sep 11, 2025
Jupyter Notebook

ashishrana1501 / Feature-Engineering

Star

This particular notebook consist of all the Feature Engineering technique and Feature Transformation technique

algorithms exploratory-data-analysis ml eda feature-selection feature-engineering imbalanced-data

Updated Jul 5, 2022
Jupyter Notebook

KonNik88 / heart-disease-ml-practice

Star

Practice notebook on heart-disease risk with a small/noisy dataset: EDA → preprocessing → classic ML baselines (scikit-learn). Not for clinical use

machine-learning scikit-learn jupyter-notebook eda healthcare classification reproducibility imbalanced-data model-evaluation heart-disease optuna

Updated Oct 3, 2025
Jupyter Notebook

OMahmoodi / imbalanced_data

Star

This notebook will walk you through the steps for dealing with an imbalanced dataset using an example of a real project that I recently completed.

imbalanced-data oversampling undersampling stratified-sampling smote-oversampler

Updated Mar 8, 2022
Jupyter Notebook

CodeFor2001 / Credit-Card-Fraud-Detection-model

Star

This repo has a notebook that I worked on for making a fraud detection model. The dataset was Highly imbalanced, so i used random undersampling to balance the data.

machine-learning python3 imbalanced-data fraud-detection classification-model fraudulent-transactions

Updated Apr 10, 2022
Jupyter Notebook

marizombie / f1-score-vs-accuracy

Star

This notebook shows how the f1 metric differs accuracy on imbalanced data. The heart disease dataset from kaggle is used (https://www.kaggle.com/datasets/kamilpytlak/personal-key-indicators-of-heart-disease).

comparison accuracy logistic-regression imbalanced-data f1-score accuracy-metrics f1score imbalance-classification

Updated Apr 8, 2022
Jupyter Notebook

mquinlan0824 / Computer-Vision-Master-Thesis-Project

Star

Contained in this repository are the Jupyter notebooks that contain the scripts used in this project. Examples include: exploratory data analysis, creation of training, validation and test data sets, and CNN model development and data extraction.

data binary fog cnn computer vision meteorology classification transfer-learning evaluation-metrics augmentation imbalanced-data imbalance-classification

Updated Jul 7, 2021
Jupyter Notebook

BatthulaVinay / phone-usage-analysis

Star

This project analyzes phone usage patterns in India and predicts the primary use of mobile devices based on various features. The notebook covers data preprocessing, exploratory data analysis (EDA), and model training using multiple classification algorithms.

Updated Feb 14, 2025
Jupyter Notebook

e181337 / data_analysis

Star

In this notebook, I applied statistical methods for imbalanced data analysis. In terms of basics, it starts with null check, data description and handling missing values. There exists right skewness in data for numerical columns. Shapiro-Wilk and Anderson darling tests are applied to prove that data is not distributed normally. Outlier detection…

statistics analysis data-analysis outlier-detection imbalanced-data chi-square-test shapiro-wilk anderson-darling-test

Updated Dec 19, 2021
Jupyter Notebook

daxpatel11 / Fraud-Detection-in-Payments-Anamaly-Detection-

Star

Detect Fraud Transaction from the dataset . The project involves dealing with unbalanced dataset and concept drift. I have implemented 4 machine learning algorithms to predict Fraud Transaction . These are - Logistic Regression ,Support Vector Machine(SVM), Local Outlier Factor(LOF) and isolation Tree.See my python 3 notebook to get more insight…

machine-learning-algorithms jupyter-notebook python3 imbalanced-data anamoly-detection

Updated Jun 8, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the imbalanced-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the imbalanced-data topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imbalanced-data

Here are 19 public repositories matching this topic...

wangz10 / class_imbalance

nnilayy / Classification-Notebook

acharles7 / data-science-notebooks

Arezoo-Dahesh / My-notebooks-on-Kaggle

aditya11ad / ML-course

pdoup / ATML-notebooks

sharonchoong / ml-notes

jincy-p-janardhanan / imbalanced-fraud-detection

rakibnsajib / Credit-Card-Fraud-Detection-on-Imbalanced-Data-Using-Machine-Learning

mdzaheerjk / Feature-Engineering

ashishrana1501 / Feature-Engineering

KonNik88 / heart-disease-ml-practice

OMahmoodi / imbalanced_data

CodeFor2001 / Credit-Card-Fraud-Detection-model

marizombie / f1-score-vs-accuracy

mquinlan0824 / Computer-Vision-Master-Thesis-Project

BatthulaVinay / phone-usage-analysis

e181337 / data_analysis

daxpatel11 / Fraud-Detection-in-Payments-Anamaly-Detection-

Improve this page

Add this topic to your repo