An end-to-end Exploratory Data Analysis of the Steam gaming platform dataset, uncovering market trends, pricing strategies, genre dynamics, and what makes a game succeed.
- Which genres dominate the Steam marketplace?
- Does price correlate with review scores?
- Are free games rated better or worse than paid games?
- How has Steam's release volume changed over time?
- What release timing strategies do publishers use?
- Does language support affect game success?
- Steam's release volume peaked in [year] and has been [trend]
- The median price for paid games is far lower than most people expect
- Indie games dominate by volume but [genre] dominates by rating
- Free games receive [better/worse] reviews than paid counterparts
- Q4 (Oct-Dec) sees significantly more releases — holiday season strategy
- Python — Pandas, NumPy, Matplotlib, Seaborn
- Dataset — Steam Games Dataset (Kaggle)
- Visualizations — 5 multi-panel analysis charts
- Clone the repo
python -m venv venvthen activatepip install -r requirements.txt- Place
games.csvin thedata/folder python analysis.py- Charts saved to
outputs/folder
pandas numpy matplotlib seaborn plotly
- Overview — Dataset size, price distribution, platform support, review sentiment
- Genre Analysis — Top genres by count and avg review score
- Pricing Intelligence — Price distribution, free vs paid, price tier analysis
- Market Trends — Release volume, genre trends, price trends over time
- Deep Insights — Publishers, achievements vs reviews, seasonal patterns