Exploratory Data Analysis (EDA) is a fundamental aspect of Data Science, involving the thorough examination of a dataset to unveil underlying patterns, trends, and associations. This process enhances our comprehension of the dataset's content and aids in making informed choices and devising effective approaches for addressing tangible business challenges. In this notebook, I will lead you through a practical application of Exploratory Data Analysis utilizing the Python programming language.
To begin with the EDA process, I will use the dataset about Instagram Reach and this dataset could be found HERE.
Overall, the dataset consists of a total of 119 rows and 13 columns with the column details as the following:
- Impressions: Count of post views (Reach)
- Home Reach: Reach originating from the home feed
- Hashtag Reach: Reach resulting from hashtags
- Explore Reach: Reach generated by appearing on the Explore page
- Other Reach: Reach from sources other than the aforementioned
- Saves: Total saves of the post
- Comments: Total comment count
- Shares: Total number of shares
- Likes: Total number of likes received
- Profile Visits: Count of visits to the profile via the post
- Follows: Number of new followers gained through the post
- Caption: Text accompanying the post
- Utilized Hashtags: Hashtags employed in the post
Check this repository in Kaggle
Disclaimer: This notebook is for my learning and practice purpose only, and the source code I run here is referenced from THIS GUIDE