Summary

Choose the best algorithm by accuracy based on cross validation and folding. For actual test also included relative weights of classes as argument to the classifier under the assumption the dataset is reflective of actual class distributions of unseen instances.

RandomForest White Wine data set 90% training 10% test False Positive Rate: 0.246913580247 False Negative Rate: 0.109756097561 Accuracy: 0.844897959184

Files

explore.py: how data was analyzed
evaluate.py: how algorithms were analyzed
train.py: a solution implementing results of first two

Tools Used

WEKA 3.7.1 - used as a sanity check to rapidly iterate through ideas in GUI
WinPython 2.7.6 - basis of python files sent - includes the following packages used
pandas
sklearn
numpy
matplotlib

winequality-white.csv Scrubbing

Convert to CSV: replace all ";" with ",".
Line 2729 replace ",," with "," (extra field)

Approach Used

Identify instances with attributes whose values may skew learning algorithms (explore.py) - https://onlinecourses.science.psu.edu/stat857/node/223
Outliers: visualize data with histograms and box plots.
Correlation: Pearson and Spearman - values close to |1| imply need for feature removal - http://www3.nd.edu/~mclark19/learn/ML.pdf
Identify possible candidate algorithms and evaluate (evaluate.py):
Dataset 1. As-is 1. Feature removal 1. Outlier removal
Record accuracy & runtime.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
evaluate.py		evaluate.py
explore.py		explore.py
train.py		train.py
winequality-red.csv		winequality-red.csv
winequality-white.csv		winequality-white.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summary

Files

Tools Used

winequality-white.csv Scrubbing

Approach Used

About

Uh oh!

Releases

Packages

Languages

orangepips/wine-quality-machine-learning

Folders and files

Latest commit

History

Repository files navigation

Summary

Files

Tools Used

winequality-white.csv Scrubbing

Approach Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages