This project aims to provide insights into the Apple Store's app ecosystem by analyzing a dataset that includes details about various apps available on the platform. Key questions addressed include identifying the most popular app categories, determining the ideal pricing for apps, and strategies for maximizing user ratings.
The objective of this project is to extract actionable insights from the Apple Store app data to help stakeholders make informed decisions. The analysis covers various aspects such as popularity of app categories, app pricing strategies, and factors influencing user ratings.
The dataset used in this project is sourced from the Apple Store and includes the following two key tables:
- AppleStore
- applestore_description_combined
Please note that the data is used for educational purposes and may have limitations regarding its completeness and accuracy.
- SQL Lite
- MS SQL Server
The analysis consists of several steps including data cleaning, exploratory data analysis (EDA), and specific analysis answering the stakeholder questions. The main insights derived from the project are:
- Most Popular App Categories: Identification of categories with the highest number of downloads and active users.
- Pricing Strategies: Analysis on how app pricing correlates with popularity and revenue generation.
- Maximizing User Ratings: Factors that influence user ratings and recommendations to improve them.
Exploratory Data Analysis (EDA):
- Checking the Unique Apps in both the Tables
- Checking for any missing values in main key fields
- Finding the number of apps per genre
- Find the overview of the apps ratings
Data Analysis:
- Understaning whether paid apps have higher rating than free apps: CASE Statement
- Check if apps supporting more languages have higher ratings: CASE Statement
- Check genre with low ratings: CASE Statement
- Check if there is a correlation between the length of the app desc and user ratings: CASE Statement
- Check the top-rated apps for each genre: WINDOW Function
- Paid Apps have better ratings than Free Apps due to various reasons
- Apps that support more languagues i.e between 10-30 languages have better ratings
- Finance and Books applications have the lowest ratings; can be improved for better ratigs
- Applications that have longer descriptions have better ratings than others
- If a new app is getting launched on the app store, aiming the app to get ratings above 3.5
- Games and Entertainment applications have more competition on the app store
- Download the CSV File (Dataset taken from Kaggle)
- Combine all the 4 description files into one table using UNION ALL
- Perform EDA using SQL Queries - Basic SQL, Aggregate Functions
- Perform Data Analysis using SQL Queries - CASE Statements and WINDOW Functions