Skip to content

Comprehensive exploratory data analysis of Kaggle's House Prices dataset using Python, pandas, seaborn, and matplotlib. Uncovers pricing patterns, feature relationships, and data insights through visualizations.

Notifications You must be signed in to change notification settings

Rivu5555/House-Regression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏑 House Prices Exploratory Data Analysis (EDA)

Python Pandas Seaborn License: MIT


πŸ“Š Overview

Discover the hidden patterns behind what drives house prices!
This project performs an in-depth exploratory data analysis (EDA) on the Kaggle House Prices - Advanced Regression Techniques dataset using Python and popular data science libraries.


πŸ“ Dataset

  • Rows: 1,460 Β  | Β  Columns: 81
  • Source: Kaggle Competition
  • Target Variable: SalePrice

πŸš€ Quick Start

  1. git clone https://github.com/Rivu5555/House-Regression.git cd house-prices-eda

  2. Install dependencies: pip install -r requirements.txt

  3. Run the Notebook: jupyter notebook house_prices_eda.ipynb

    If the data file is not present, download it here and place it in the input/ folder.


🧐 EDA Highlights

  • Target Distribution:
    SalePrice Distribution

  • Missing Value Analysis:
    Bar plots to visualize missing data columns.

  • Key Feature Relationships:

  • GarageArea vs SalePrice

  • OverallQual vs SalePrice

  • SaleType impact

  • Visual Insights:

  • Histograms, KDE plots, Boxplots

  • Scatter plots for feature relationships

  • Major Findings:

  • Higher OverallQual and GarageArea often predict higher SalePrice

  • Certain SaleTypes are linked to price outliers

  • Notable missing data in some categorical features


πŸ“¦ Requirements

  • Python 3.8+

  • pandas

  • numpy

  • matplotlib

  • seaborn

  • 🀝 Contribution

Pull requests are welcome! For major changes, open an issue first to discuss what you would like to change.


πŸ“š References


Happy Analyzing! πŸ πŸ“ˆ

About

Comprehensive exploratory data analysis of Kaggle's House Prices dataset using Python, pandas, seaborn, and matplotlib. Uncovers pricing patterns, feature relationships, and data insights through visualizations.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published