Skip to content
#

hiveql

Here are 19 public repositories matching this topic...

This project demonstrates real-world big data engineering practices using Apache Spark (PySpark). It covers the entire data pipeline — from ingestion, transformation, and validation to exploration and reporting. Ideal for data engineers and analysts looking to gain practical experience with Spark, Airflow, and data lake design.

  • Updated May 28, 2025
  • Python

Improve this page

Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."

Learn more