Skip to content

tchen0125/rainridepublic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Rain or Ride

Rain and ride weather data analysis.

Project Overview

Members

Name NetID
Bongjun Jang bj2351
Luka Tragic lt2205
Terrance Chen tc3325

These are the people who I worked on this analysis with they all contributed majorly to this project.

Structure

  1. Ingestion: commands or codes to download data
  2. ETL: transform or clean data and store them in HDFS
  3. Profiling: provides insights for indivial data
  4. Analytics: statistical analysis on all dataset (correlation, etc.)

Public Usage

To run this code outside of a NYU premise you can upload the dataset to hadoop HDFS and to change the username in the code for the directories. You need to add the hard path of which directly for which.

Data should flow where they share folders for input output input output for each portion of the code.

About

Rain and ride weather data analysis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages