Skip to content

jinyao-lee/CS3244-SMS-Spam-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CS3244: SMS Spam Project

CS3244 project - working on SMS spam classifier using K-Nearest Neighbors (k-NN) algorithm.

This repo contains the code used for implmenting k-NN algorithm to classify spam and ham SMS messages from the dataset spam.csv.

File contents list: -

File Details
knn_v2.py Contains the source code for k-NN implementation. This version randomly shuffles and obtains 20% of training data to be the validation set.
knn_v3.py Contains the source code for k-NN implementation. This version implements 10-fold cross validation.
Training data Contains the training data used to classify SMS messages.
Experiment results Contains the experimental accuracy results for classifying messages using k-NN. (Note: x-axis denotes k-value, y-axis denotes validation error)

About

CS3244 project - worked on SMS spam classifier using KNN

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages