Jupyter Notebook presentation for class imbalance in binary classification
-
Updated
Aug 11, 2018 - Jupyter Notebook
Jupyter Notebook presentation for class imbalance in binary classification
Data Science Classification General Notebook
Credit card fraud detection, gender classification from name etc.
My kernels on Kaggle
Best for beginners | Well explained ML algorithms | organized Notebooks | Case Studies
Proposed assignment notebooks for Advanced Topics in Machine Learning tasks
Machine learning notebooks
This notebook tries to make fraud/not fraud predictions on a transactions dataset with highly imbalanced data.
A Jupyter notebook that applies machine learning techniques to detect credit card fraud on imbalanced data. It covers data preprocessing, EDA, handling class imbalance, training classifiers (Logistic Regression, Decision Tree, RandomForest), and saving the trained models.
This particular notebook consist of all the Feature Engineering technique and Feature Transformation technique
This notebook will walk you through the steps for dealing with an imbalanced dataset using an example of a real project that I recently completed.
This repo has a notebook that I worked on for making a fraud detection model. The dataset was Highly imbalanced, so i used random undersampling to balance the data.
This notebook shows how the f1 metric differs accuracy on imbalanced data. The heart disease dataset from kaggle is used (https://www.kaggle.com/datasets/kamilpytlak/personal-key-indicators-of-heart-disease).
Contained in this repository are the Jupyter notebooks that contain the scripts used in this project. Examples include: exploratory data analysis, creation of training, validation and test data sets, and CNN model development and data extraction.
This project analyzes phone usage patterns in India and predicts the primary use of mobile devices based on various features. The notebook covers data preprocessing, exploratory data analysis (EDA), and model training using multiple classification algorithms.
In this notebook, I applied statistical methods for imbalanced data analysis. In terms of basics, it starts with null check, data description and handling missing values. There exists right skewness in data for numerical columns. Shapiro-Wilk and Anderson darling tests are applied to prove that data is not distributed normally. Outlier detection…
Detect Fraud Transaction from the dataset . The project involves dealing with unbalanced dataset and concept drift. I have implemented 4 machine learning algorithms to predict Fraud Transaction . These are - Logistic Regression ,Support Vector Machine(SVM), Local Outlier Factor(LOF) and isolation Tree.See my python 3 notebook to get more insight…
Add a description, image, and links to the imbalanced-data topic page so that developers can more easily learn about it.
To associate your repository with the imbalanced-data topic, visit your repo's landing page and select "manage topics."