Skip to content

Predicting Housing Market Conditions using PySpark and Machine Learning

Notifications You must be signed in to change notification settings

Kapil1917T/housing-market-prediction-with-zillow

Repository files navigation

Housing Market Prediction

Predicting Demand and Supply of Housing Markets in the United States using PySpark and Machine Learning.

Project Overview

This project aims to predict housing market conditions (buyers' market, sellers' market, or neutral market) using machine learning techniques and PySpark. We analyze multiple factors including:

  • Housing market metrics from Zillow
  • Building permits data
  • Treasury bill rates
  • Bank failure information

Data Sources

Project Structure

housing-market-prediction/
├── data/               # Data files
├── notebooks/         # Jupyter notebooks
├── src/               # Source code
├── docs/             # Documentation
└── requirements.txt  # Project dependencies

Key Features

  • Multi-source data integration using PySpark
  • Machine learning models:
    • Logistic Regression
    • Random Forest
    • Naive Bayes
  • Handling class imbalance
  • Seasonal trend analysis
  • Geographic market variation analysis

Results

  • Achieved over 80% accuracy in market condition prediction
  • Logistic Regression performed best with:
    • 87% precision for sellers' market
    • 85% precision for buyers' market
  • Successfully identified seasonal patterns in market behavior

Setup and Installation

  1. Clone the repository
git clone https://github.com/YOUR_USERNAME/housing-market-prediction.git
cd housing-market-prediction
  1. Install dependencies
pip install -r requirements.txt

Contributors

  • Aadit Malikayil
  • Bhavika Karale
  • Kapil Tare
  • Shreyas Kashyap

About

Predicting Housing Market Conditions using PySpark and Machine Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published