2017/01 ~2020/06 Yellow Taxi (28.4GB)
data rows : 317,547,921 rows
https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
- 4 PCs
- 32 Cores
- 42 GB Ram
Using mapreduce-base K-Mean clustering Drop off point to find the best position which Waiting for guests
https://github.com/linzino7/Analyzing_NYC_Taxi_Data/blob/main/Report.pdf