This project builds a credit card fraud detection system using machine learning on a highly imbalanced dataset. We used Random Forest for classification and SMOTE to handle data imbalance.
The dataset contains anonymized features (V1 to V28), Time, Amount, and a Class label indicating fraud or non-fraud.
- Data Preprocessing (Scaling, Handling Imbalance).
- Model Building with Random Forest Classifier.
- Evaluation with accuracy, precision, recall, F1-score, and ROC-AUC.
- SMOTE was used to balance the dataset.
- Python 3.x
- Pandas, NumPy
- Scikit-learn
- Imbalanced-learn
- Matplotlib, Seaborn