Skip to content

Hadoop based implementation for K-Means Algorithm on a thunderstorm dataset

License

Notifications You must be signed in to change notification settings

IT-Department-Projects/DCS-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a Python implementation of K-Means clustering algorithm using the Map Reduce paradigm. It is customized for processing a thunders dataset, extracted from STARNET (Sferics Timing And Ranging NETwork)

Under data/ folder is a sample of the thunders that "falled" on February 28, 2014.

Under results/ folder are CSV datasets containing all thunders of that date in a single file and the resulting clusters (for k=10 and k=50).

About

Hadoop based implementation for K-Means Algorithm on a thunderstorm dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published