This project analyzes internet usage data from an ISP log file (isp.csv
) to uncover session behavior, peak usage times, daily/hourly trends, anomalies, and more. It includes data preprocessing, insightful analytics, and visualizations.
Input File: isp.csv
Each row in the dataset contains:
MAC Id
: Device identifierLogin Time
: Session start timeSession Time
: Duration of the session (HH:MM:SS)Download
: Data downloaded in MBUpload
: Data uploaded in MB
- Load and clean the dataset
- Engineer useful time-based features
- Analyze total download/upload, sessions, and durations
- Identify top usage days and active hours
- Compare weekday vs weekend usage
- Detect anomalies (long sessions, rapid relogins, high data usage)
- Visualize trends and insights
- Generate a summary report
- π¦ Total Download: ~935 GB
- π Total Upload: ~68 GB
- π Total Sessions: 1163
- π Average Session Duration: ~29.27 minutes
- π‘ Peak Download Day: 2025-06-17 (~102.78 GB)
- πΌ Upload/Download Ratio: ~0.07
- β± Longest Session: 61.48 minutes
- β‘ Shortest Session: 0 minutes (rapid relogin)
- π Line plot: Daily download/upload trends
- π Bar chart: Top 5 usage days
- π§ Heatmap: Hourly login distribution by weekday
- π Histogram: Session time distribution
- π₯§ Pie chart: Upload vs Download share
- π Cumulative usage plot: Over time
- π Daily session duration: Trend over the month
pandas
β data processingmatplotlib
/seaborn
β data visualizationnumpy
β numerical computationsjupyter notebook
β analysis interface
- Clone the repository
git clone https://github.com/your-username/internet-usage-analysis.git cd internet-usage-analysis