This project involves classifying BBC News headlines into their respective categories [ Tech, Entertainment, Sports, Politics, and Business ]. The classification is achieved through custom implementations of Naive Bayes and Support Vector Machine (SVM) classifiers.
- Dataset:
- Used the BBC News dataset containing categories and corresponding headlines.
- Preprocessing:
- Applied Natural Language Processing (NLP) technique such as word tokenization.
- Utilized pickling to save processed data for efficient reuse.
- Custom Implementations:
- Naive Bayes classifier developed from scratch.
- Support Vector Machine (SVM) classifier implemented from scratch.
- Model Comparison:
- Generated performance comparison graphs for the implemented classifiers.
- Clone the repository:
https://github.com/KoushikReddy9963/News-Article-Classifier.git
- Enter the directory:
cd News-Article-Classifier
- Install dependencies:
pip install pickle pip install numpy pip install pandas pip install nltk pip install seaborn pip install matplotlib pip install sklearn