Skip to content

This project demonstrates how to generate synthetic (marine ecological) data and apply unsupervised machine learning (hierarchical clustering) to explore patterns in policy coverage across marine zones.

License

DolapoSalim/hierarchical-clustering-and-dendrogram

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

5 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Marine Habitat Clustering Using Hierarchical Analysis

Contained in this repo is a project that demonstrates how to generate synthetic marine ecological data and apply unsupervised machine learning (hierarchical clustering) to explore patterns in policy coverage across marine zones.


Project Overview

  • ๐Ÿ”ง Data Generation: Simulates 20 marine zones with binary presence/absence data for 6 ecological policies.
  • ๐Ÿง  Distance Metric: Jaccard distance โ€” ideal for binary attributes.
  • ๐ŸŒณ Clustering Method: Hierarchical clustering with complete linkage.
  • ๐ŸŒฟ Visualization: Dendrogram to reveal how zones group based on shared protections.
  • ๐Ÿ“Š Output:
    • generated_marine_zones.csv โ€” synthetic raw data
    • clustered_marine_zones.csv โ€” same data with cluster labels
    • dendrogram_marine_zones.png โ€” dendrogram image

Ecological Policies Simulated

Each marine zone is evaluated for the presence (1) or absence (0) of the following protections:

  • Coral Reef Protection
  • Fishing Ban
  • Turtle Nesting Zone
  • Oil Drilling Ban
  • Marine Sanctuary Status
  • Mangrove Forest Protection

๐Ÿ› ๏ธ How to Run

  1. Clone or download this repo
  2. Install dependencies:
    pip install numpy pandas scipy matplotlib

About

This project demonstrates how to generate synthetic (marine ecological) data and apply unsupervised machine learning (hierarchical clustering) to explore patterns in policy coverage across marine zones.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published