Skip to content

benchmark analysis of sorting algorithms with multi-level attribute comparisons for resolving duplicate entries in real-world datasets, including cross-platform performance metrics

Notifications You must be signed in to change notification settings

hemanialaparthi/wapbac-sorting

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multi-Level Comparison Sorting Algorithms

Python 3.10+

A comparative analysis of sorting algorithms with multi-level attribute comparisons.

🔄 Fork Information

This project was forked from file-sorting-starter and significantly extended with:

  • Multi-level sorting capabilities
  • Expanded dataset support

📌 Overview

This project evaluates three sorting algorithms (Bubble Sort, QuickSort, and Timsort) when handling datasets with duplicate entries in primary attributes. Key features:

  • Hierarchical tie-breaking (country > name > phone_number > job > email)
  • Performance benchmarking across duplicate rates (25-100%)
  • Cross-platform analysis (macOS vs Windows)

🔗 Detailed Blog Post on Key Findings

🛠️ Installation

Requirements

  • Python 3.10+
  • No external dependencies
git clone https://github.com/hemanialaparthi/wapbac-sorting.git
cd wapbac-sorting
poetry install

🚀 Usage

Example Command:

poetry run filesorter --attribute email --approach bubblesort_multilevel --input-file input/people.txt --output-file output/people.txt

About

benchmark analysis of sorting algorithms with multi-level attribute comparisons for resolving duplicate entries in real-world datasets, including cross-platform performance metrics

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.6%
  • Nix 0.4%