Skip to content

Capstone project for my Microsoft Profesional Program in Data Science

License

Notifications You must be signed in to change notification settings

edumunozsala/Predicting_Mortgage_Approvals

Repository files navigation

Predicting Mortgage Approvals

Capstone project for my Microsoft Professional Program in Data Science

Link to my blog post

This repository contains many of the notebooks developed to analyze a dataset of mortgage approvals. This exercise is the capstone project in the Microsoft Professional Program in Data Science and the report included is the result of this analysis. Then as part of the capstone I developed a machine learning model to classify when a mortgage application would be accepted or not by the loan company. The ml model was developed using the Microsoft tool, Azure Machine Learning Studio.

I have all included some Jupyter notebooks used to built some basic classfication model based in decision trees. But they are not intended to be the best classifier, they are just some simple excercises in building a classifier using the Sci-Kit Learn lybrary on Python.

Some of the notebooks are under construction and maybe some new model will be included in the future.

Content

Problem and data description

Predicting if a mortgage application will either be accepted or denied within the USA will be very useful. This report describes the process taken to do exactly that, and notably the prediction is accomplished without industry standard features such as credit score, debt to income ratio, and loan to value ratio. The datasets provided for this project were adapted from the Federal Financial Institutions Examination Council, or better known as the FFIEC. The training dataset included 500,000 observations across 23 features, with a unique row identifier, and a known acceptance outcome. This data was used to train a binary classification model using Supervised Machine Learning (SML). A second dataset with 500,000 additional observations, lacking a known acceptance outcome, was then utilized to ‘test’ the model by creating predictions of the acceptance outcome.

A full description can be found in the Exploratory Data Analysis described in this post.

Machine learning model built in Azure Machine Learning Studio

From Microsoft Doc:

*Microsoft Azure Machine Learning Studio (classic) is a collaborative, drag-and-drop tool you can use to build, test, and deploy predictive analytics solutions on your data. Azure Machine Learning Studio (classic) publishes models as web services that can easily be consumed by custom apps or BI tools such as Excel.

Machine Learning Studio (classic) is where data science, predictive analytics, cloud resources, and your data meet.*

In the second part of the capstone I built a predictive model on Azure ML Studio to predict when an mortgage approval would be accepted or not, here is the link for a description

Important: This repository is still under construction and new files may be included in the coming weeks

License

This repository is under the GNU General Public License v3.0

About

Capstone project for my Microsoft Profesional Program in Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published