Pinned Loading
-
EL_gcs-to-bigquery
EL_gcs-to-bigquery PublicAn EL pipeline built with Apache Airflow that downloads a file from the web uploads it to Google Cloud Storage, and creates an external table in BigQuery for data storage and analysis.
Python
-
ETL_spark-on-dataproc
ETL_spark-on-dataproc PublicA Pyspark project that performs ETL on a Dataproc cluster and writes data to Google Cloud Storage/BigQuery.
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.