SVMs-from-Scratch

Implementing SVMs on the Splice Dataset from UCI’s machine learning data repository. The provided binary classification dataset has 60 input features, and the training and test sets contain 1,000 and 2,175 samples, respectively. The files containing features are called train data.txt and test data.txt, and the files containing labels are called train label.txt and test label.txt.

Data preprocessing

Preprocess the training and test data by

Computing the mean of each feature and subtracting it from all values of this feature.
Dividing each feature by its standard deviation, defined as

for feature k. This type of preprocessing is useful for SVMs, as SVMs attempt to maximize the distance between the separating hyperplane and the support vectors. If one feature (i.e., one dimension in this space) has very large values, it will dominate the other features when calculating this distance. Rescaling the features (e.g. to [0, 1]), will ensure that they all have the same influence on the distance metric.

Implementing a linear SVM

The input of train svm contains training feature vectors and labels, as well as the tradeoff parameter C. The output of train svm contain the SVM parameters (weight vector and bias).

To solve the above quadratic problem, I used the cvxopt.solvers.qp function in the CVXOPT 4 Python package. For test svm, the input contains testing feature vectors and labels, as well as SVM parameters. The output contains the test accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
1.png		1.png
4.png		4.png
LICENSE		LICENSE
README.md		README.md
svm.py		svm.py
test_data.txt		test_data.txt
test_label.txt		test_label.txt
train_data.txt		train_data.txt
train_label.txt		train_label.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SVMs-from-Scratch

Data preprocessing

Implementing a linear SVM

About

Uh oh!

Releases

Packages

Languages

License

ArvindSubramaniam/SVMs-from-Scratch

Folders and files

Latest commit

History

Repository files navigation

SVMs-from-Scratch

Data preprocessing

Implementing a linear SVM

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages