gcp-dataproc

Here are 5 public repositories matching this topic...

prakashdontaraju / google-cloud-ecommerce

ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipeline ― Cloud Storage, Dataproc, PySpark, Cloud Spanner and Tableau

Updated Mar 9, 2022
Python

emanuelegiona / CC2019

Star

Project for Cloud Computing course (A.Y. 2018/2019)

streaming apache-spark gcp python3 cloud-computing word-count sapienza-university gcp-dataproc

Updated Jan 28, 2020
Python

RickLeite / Hadoop-Google-DataProc-DIOstudy

Star

Hadoop Google DataProc DIO study

hadoop google-cloud-platform gcp-cloud-functions gcp-dataproc digital-innovation-one

Updated Sep 4, 2021
Python

NaveedMohiuddin / real-time-stream-processing-kafka-spark-gcp

Star

Real-time stream processing project using Apache Kafka and Spark Streaming on Google Cloud Dataproc. Includes Python producers/consumers, Spark DStream word count, and full deployment with screenshots.

python big-data pyspark spark-streaming apache-kafka real-time-data gcp-dataproc

Updated Apr 3, 2025
Python

ElhNour / large-scale-data-management-spark

Star

Process large amount of data and implement complex data analyses using Spark. The dataset has been made available by Google. It includes data about a cluster of 12500 machines, and the activity on this cluster during 29 days.

spark gcp-dataproc large-scale-data-analytics

Updated Jan 13, 2023
Python

Improve this page

Add a description, image, and links to the gcp-dataproc topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gcp-dataproc topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gcp-dataproc

Here are 5 public repositories matching this topic...

prakashdontaraju / google-cloud-ecommerce

emanuelegiona / CC2019

RickLeite / Hadoop-Google-DataProc-DIOstudy

NaveedMohiuddin / real-time-stream-processing-kafka-spark-gcp

ElhNour / large-scale-data-management-spark

Improve this page

Add this topic to your repo