Skip to content

Commit e2c28de

Browse files
authored
Update README.md
1 parent 0d8953b commit e2c28de

File tree

1 file changed

+36
-2
lines changed

1 file changed

+36
-2
lines changed

README.md

Lines changed: 36 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,36 @@
1-
# hierarchical-clustering-and-dendrogram
2-
This project demonstrates how to generate synthetic (marine ecological) data and apply **unsupervised machine learning** (hierarchical clustering) to explore patterns in policy coverage across marine zones.
1+
# Marine Habitat Clustering Using Hierarchical Analysis
2+
3+
Contained in this repo is a project that demonstrates how to **generate synthetic marine ecological data** and apply **unsupervised machine learning** (hierarchical clustering) to explore patterns in policy coverage across marine zones.
4+
5+
---
6+
7+
### Project Overview
8+
9+
- 🔧 **Data Generation**: Simulates 20 marine zones with binary presence/absence data for 6 ecological policies.
10+
- 🧠 **Distance Metric**: Jaccard distance — ideal for binary attributes.
11+
- 🌳 **Clustering Method**: Hierarchical clustering with complete linkage.
12+
- 🌿 **Visualization**: Dendrogram to reveal how zones group based on shared protections.
13+
- 📊 **Output**:
14+
- `generated_marine_zones.csv` — synthetic raw data
15+
- `clustered_marine_zones.csv` — same data with cluster labels
16+
- `dendrogram_marine_zones.png` — dendrogram image
17+
---
18+
19+
### Ecological Policies Simulated
20+
21+
Each marine zone is evaluated for the presence (1) or absence (0) of the following protections:
22+
- Coral Reef Protection
23+
- Fishing Ban
24+
- Turtle Nesting Zone
25+
- Oil Drilling Ban
26+
- Marine Sanctuary Status
27+
- Mangrove Forest Protection
28+
29+
---
30+
31+
## 🛠️ How to Run
32+
33+
1. Clone or download this repo
34+
2. Install dependencies:
35+
```bash
36+
pip install numpy pandas scipy matplotlib

0 commit comments

Comments
 (0)