Skip to content

Netflix is known for its work in data science, AI & ML, particularly for building strong recommendations models and algorithms, this analysis consist of dataset of more than 9,000 movies

Notifications You must be signed in to change notification settings

stepforward2023/Netflix-Movie-Data-Analysis-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Netflix-Movie-Data-Analysis-Project (Python | Data Visualization)

This project performs an in depth exploratory data analysis (EDA) on a Netflix Movie dataset containing more than 9,000 flims records. The Goal is to understand trends in genres, popularity, votes, and yearly release patterns through visual insights.

Netflix is known for its work in data science, AI & ML, particularly for building strong recommendations models and algorithms, that understand customer behaviour and patterns.

Features of the Project

  • Cleaned and preprocessed the Netflix dataset using Pandas
  • Handled missing values and standardized categorical features
  • Analyzed distribution of movies across genres
  • Visualized popularity patterns and vote counts
  • Identified most frequent genres on Netflix
  • Determined highest-voted and lowest-voted movies with their genres
  • Explored yearly movie production trends
  • Created multiple visulaizations inclusing bar charts, count plots, and heatmaps

Key Questions Answered

  • Which genre appears most frequently on Netflix?

  • Which genres have the highest votes?

  • Which movies have the highest popularity and what are their genres?

  • Which movies have the lowest popularity and what are their genres?

  • Which year had the most movies filmed?

    Tech Stack

    • Python
    • Pandas
    • Numpy
    • Matplotlib
    • Seaborn
    • Jupyter Notebook

    Visulaizations

    • Genre frequency bar graphs
    • Popularity comparsion graphs
    • Vote distributions plots
    • Release-year bar charts
    • Heatmaps for correlations

    How to Run the Project

    • Install dependencies: "pip install pandas numpy matplotlib seaborn"
    • Run the notebook: jupyter notebook

Conclusion

-This analysis reveals Netflix's most popular genres, content trends over the years , and voting pattterns. The project demonstrates strong skills in data cleaning, visualization , and pattern extractions.

About

Netflix is known for its work in data science, AI & ML, particularly for building strong recommendations models and algorithms, this analysis consist of dataset of more than 9,000 movies

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published