GitHub - xinguanca/MLproject_creditcardfraud: My first machine learning project.

Machine Learning Project -- Credit Card Fraud

This project built a supervised extra tree model to shape the issue.

Built a feature selection function based on the correlation coefficient matrix and data visualization, which effectively reduced 78% of the noise (28 features down to 6) and maintained the overall model performance.
Improved recall while maintaining precision by applying a customized algorithm and precision and recall curve.

Demonstrated dataset manipulation use case through Numpy and Pandas.
Demonstrated data visualization use cases through Seaborn, matplotlib, Plotly.
Demonstrated model evaluation metrics use cases through classification reports, confusion matrix, precision and recall curve.
Demonstrated resampling methods use cases through SMOTE and Random Sampler.
Demonstrated model building, evaluation, hyperparameter tuning and pipeline workflow use cases through Sklearn.
Demonstrated dimension reduction use cases through Autoencoder and UMAP.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
autoencoder_and_UMAP.ipynb		autoencoder_and_UMAP.ipynb
extra_tree_model.ipynb		extra_tree_model.ipynb
full_paper.pdf		full_paper.pdf
pycaret_code.ipynb		pycaret_code.ipynb