Skip to content

This project aims to evaluate the performance of different clustering algorithms on multiple datasets.

Notifications You must be signed in to change notification settings

hericlesferraz/comparative_study_clustering_techniques

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Comparative Study of Clustering Techniques

Overview

This project aims to evaluate the performance of different clustering algorithms on multiple datasets. The algorithms compared are K-means, DBSCAN, and HDBSCAN. The datasets used for this analysis include Breast Cancer, Wine Quality, Abalone, and Ecoli datasets. The performance metrics used to evaluate the clustering algorithms are Silhouette Score, Rand Score, and Dunn Index.

Datasets

The datasets used in this project are:

  1. Breast Cancer Dataset: Contains features computed from a digitized image of a fine needle aspirate (FNA) of a breast mass.

  2. Wine Quality Dataset: Contains various chemical tests of wines and their quality ratings.0

  3. Abalone Dataset: Contains physical measurements of abalones and their age.

  4. Ecoli Dataset: Contains features related to E.coli proteins and their cellular localization sites.

Installation

To run this project, ensure you have Python installed on your system and poetry.

Clone the project:

git clone git@github.com:hericlesferraz/comparative_study_clustering_techniques.git

Enter inside the root project:

cd comparative_study_clustering_techniques

Install dependencies

poetry install

Usage

Run each cell on the juyter notebook, located in and enjoy the results:

comparative_study_clustering_techniques/jupyter/comparative_study_clustering_techniques.ipynb

Results

Plot results of metric in algorithms:

Image

Plot elbow methog:

Image

About

This project aims to evaluate the performance of different clustering algorithms on multiple datasets.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published