Built a complete loan analysis system from raw data to working web app. Covers everything from SQL data prep to deployed machine learning models.
- SQL Work: Combined 3 messy datasets, created new features, cleaned everything up
- Tableau Dashboards: Made charts showing loan trends and borrower patterns
- Python Models: Built models to predict loan amounts and spot risky loans
- Web App: Deployed working app so people can actually use the models
- Simple Linear Regression: Basic model using just loan amount
- Multiple Linear Regression: Uses loan amount, charges, and duration - gets 99.7% accuracy
- Logistic Regression: Predicts if loans will be good or bad - had to fix class imbalance issues
Amount Prediction: Works great - can predict what someone will owe with 99.7% accuracy.
Risk Classification: Started terrible (missed 99% of bad loans), got much better after balancing the data. Now catches over half the risky loans instead of basically none.
notebooks/: All the Python analysis and model buildingsql/: Queries I used to prep and combine the datamodels/: The actual trained modelsapp/: Web app for making predictionstableau/: Dashboards and visualizations
- SQL for data engineering
- Tableau for business dashboards
- Python for machine learning
- Streamlit for the web app
Real working system that goes from messy data to predictions you can actually use.