Skip to content

Intelligent Data Classifier using KNN and Decision Trees for multi-label classification. Implements algorithms from scratch with hyperparameter tuning to optimize accuracy and handle complex label relationships.

Notifications You must be signed in to change notification settings

AishaniPandey/intelligent-data-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Intelligent Data Classifier

Overview

The Intelligent Data Classifier is a robust machine learning model using K-Nearest Neighbors (KNN) and Decision Trees to perform multi-label classification. This project demonstrates the application of fundamental algorithms developed from scratch, aimed at achieving high accuracy in complex label prediction tasks through meticulous hyperparameter tuning.

Features

  • K-Nearest Neighbors (KNN): Custom implementation of the KNN algorithm, allowing for adjustable parameters such as the number of neighbors and distance metrics.
  • Decision Trees: Utilizes both Powerset and MultiOutput formulations to address complex classification scenarios.
  • Hyperparameter Tuning: Detailed optimization process to enhance model performance.
  • Data Analysis: Extensive exploratory data analysis with visualizations to understand data distributions and relationships.

Technologies Used

  • Python
  • Jupyter Notebook
  • NumPy
  • Matplotlib
  • Bash Scripting

Installation

Clone this repository:

git clone https://github.com/yourusername/intelligent-data-classifier.git

Navigate to the project directory:

cd intelligent-data-classifier

Install the required dependencies:

pip install -r requirements.txt

Usage

Run the Jupyter Notebooks to explore the dataset and model implementation:

jupyter notebook

Execute the bash script to test the model with new data:

bash eval.sh

About

Intelligent Data Classifier using KNN and Decision Trees for multi-label classification. Implements algorithms from scratch with hyperparameter tuning to optimize accuracy and handle complex label relationships.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages