This project illustrates practical SQL-based data cleaning using a publicly available Global Layoffs dataset from Kaggle. The focus is on transforming raw data into a standardized, consistent, and high-quality dataset suitable for analysis or reporting.
- Preserved the raw dataset and cleaned using staging tables
- Removed duplicate records using
ROW_NUMBER() - Standardized company, industry, country, and date fields
- Handled NULL and missing values logically
- Filtered out rows lacking usable layoff information
- SQL (MySQL-compatible)
- Window functions (
ROW_NUMBER) - Data standardization
- Date transformation
- Data quality validation
A clean and consistent dataset that is ready for exploratory data analysis or further reporting tasks.