Skip to content

Aspiring Data Scientist | Skilled in Python, Pandas, NumPy, Matplotlib, and Data Analysis. Passionate about turning raw data into actionable insights and developing data-driven solutions.

Notifications You must be signed in to change notification settings

Ishuz-data-Git/pandas-student-performance-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🧠 Pandas Student Performance Analysis πŸ“‹ Project Overview

This project analyzes student exam performance using Pandas, NumPy, and Matplotlib. It demonstrates how data can be cleaned, transformed, analyzed, and visualized to extract key insights about student outcomes such as grades, pass/fail ratio, and preparation effectiveness.

🧹 Step 1–2: Data Cleaning & Preparation

Loaded dataset using Pandas

Checked for missing values and duplicates

Renamed inconsistent column names for clarity

Converted categorical columns (Gender, Ethnicity, etc.) to category type

βš™οΈ Step 3: Feature Engineering

Created new columns:

Total_Score β†’ Sum of all subjects

Average_Score β†’ Mean of scores per student

Result β†’ Pass/Fail classification (based on average β‰₯ 33)

Grades β†’ Assigned grade letters (A, B, C, D, E, F) using conditional logic

Used NumPy operations for efficient calculations

πŸ” Step 4: Exploratory Data Analysis (EDA)

Average Score by Gender

Average Score by Ethnicity

Effect of Test Preparation on performance

Correlation analysis between numerical features

πŸ“Š Step 5: Visualization Dashboard

Created an interactive data visualization dashboard using Matplotlib and Seaborn:

Visualization Purpose Histogram Distribution of average scores Barplots Comparison by Gender, Ethnicity, and Test Prep Boxplot Spread of scores by Gender Heatmap Correlation between numerical variables Countplot Pass/Fail distribution

πŸ’Ύ Step 6: Export Cleaned Data

Exported the final cleaned dataset for reuse or ML modeling:

df.to_csv('StudentsPerformance_Cleaned.csv', index=False)

File saved as β†’ StudentsPerformance_Cleaned.csv

πŸ“ˆ Insight Highlights:

Students who completed Test Preparation scored significantly higher.

Female students slightly outperformed males on average.

Group E ethnicity performed best overall.

Strong positive correlation between Math, Reading, and Writing scores.

About

Aspiring Data Scientist | Skilled in Python, Pandas, NumPy, Matplotlib, and Data Analysis. Passionate about turning raw data into actionable insights and developing data-driven solutions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published