An exploratory data analysis of US road accidents data using Python's data analysis and visualization libraries.
- Introduction
- Data Preparation and Cleaning
- Exploratory Analysis and Visualization
- Insights
- Technologies Used
- Getting Started
- Usage
This project focuses on analyzing and visualizing the US road accidents dataset to gain insights into various aspects of accidents, such as their frequency, severity, time distribution, and geographic distribution. The analysis is performed using Python's data analysis and visualization libraries, including Pandas, Matplotlib, Seaborn, and Plotly.
The dataset is loaded using Pandas and cleaned to handle missing and incorrect values. Exploratory analysis is performed to understand the data's structure and identify potential issues.
The analysis includes:
- Distribution of accidents by city, state, and timezone
- Impact of weather conditions on accidents
- Frequency of accidents by hour, day, month, and year
- Severity of accidents and its impact on traffic
- Geographic distribution of accidents using interactive maps
The analysis yields insights such as:
- 50+ Insights
- Most accident-prone cities and states
- Trends in accidents over the years
- Peak hours and days for accidents
- Python
- Pandas
- Matplotlib
- Seaborn
- Plotly
- Jupyter Notebook
- Clone the repository:
git clone https://github.com/your-username/us-road-accidents-analysis.git
- Install the required packages using pip:
pip install -r requirements.txt
- Run the Jupyter Notebook for detailed analysis:
jupyter notebook Road_Accidents_Analysis.ipynb
Feel free to use the analysis and visualization code as a reference for your own projects or to gain insights from the US road accidents dataset.