Classification-and-Regularization-Project

Classification

We conduct classification task to predict the body performance of class A,B,C,D. Conclusion: Based on the model fits, a non-linear is more appropriate since the accuracy generated from QDA has the highest accuracy, though the accuracy are pretty close to each other in all classification methods.

Regularization

For this portion of the project, we're going to use a data set containing information on the Australian housing market and apply regularization techniques to make predictions on house prices. The data set will have 81 columns of data.

Overall procedure

Numeric pipeline:

We will impute missing values with medians
After that, we will standardize each vector of data to ensure each feature is given equal weighting by our models

Categorical pipeline:

In each vector of data, we will fill missing values with the most commonly observed sample
After that, we will transform each vector to numeric representation using label encoding or by making them dummy variables, where appropriate

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
bodyPerformance.csv		bodyPerformance.csv
house_data.csv		house_data.csv
proj1.Rmd		proj1.Rmd
proj1.pdf		proj1.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Classification-and-Regularization-Project

Classification

Regularization

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

claire-yiting-zhang/Classification-and-Regularization-Project

Folders and files

Latest commit

History

Repository files navigation

Classification-and-Regularization-Project

Classification

Regularization

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages