BigQuery
Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization’s biggest questions with zero infrastructure management. BigQuery’s scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.
📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference.
Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.
Here are 83 public repositories matching this topic...
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
-
Updated
Nov 3, 2025 - Java
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
-
Updated
Oct 22, 2025 - Java
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
-
Updated
Sep 12, 2024 - Java
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
-
Updated
Oct 31, 2025 - Java
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
-
Updated
Jun 3, 2024 - Java
CATA.Search. Blockchain database, cata metadata query
-
Updated
Aug 19, 2021 - Java
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
-
Updated
Mar 5, 2024 - Java
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
-
Updated
Nov 2, 2025 - Java
Official repository of SquashQL, the SQL query engine for multi-dimensional and hierarchical analysis that empowers your SQL database
-
Updated
Oct 25, 2025 - Java
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
-
Updated
Oct 12, 2020 - Java
Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
-
Updated
Feb 13, 2018 - Java
Convenient Dataflow pipelines for transforming data between cloud data sources
-
Updated
Feb 14, 2024 - Java
Replicates any database (CDC events) to Bigquery in real time
-
Updated
Oct 27, 2025 - Java
Use Remote Functions to tokenize data with DLP in BigQuery using SQL
-
Updated
May 29, 2025 - Java
Released May 19, 2010
- Followers
- 67 followers
- Repository
- GoogleCloudPlatform/bigquery-utils
- Website
- github.com/topics/bigquery
- Wikipedia
- Wikipedia