BigQuery
Google BigQuery enables companies to handle large amounts of data without having to manage infrastructure. Google’s documentation describes it as a « serverless architecture (that) lets you use SQL queries to answer your organization's biggest questions with zero infrastructure management. BigQuery's scalable, distributed analysis engine lets you query terabytes in seconds and petabytes in minutes. » Its client libraries allow the use of widely known languages such as Python, Java, JavaScript, and Go. Federated queries are also supported, making it flexible to read data from external sources.
📖 A highly rated canonical book on it is « Google BigQuery: The Definitive Guide », a comprehensive reference.
Another enriching read on the subject is the inside story told in the article by the founding product manager of BigQuery celebrating its 10th anniversary.
Here are 81 public repositories matching this topic...
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
-
Updated
Oct 14, 2024 - Java
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
-
Updated
Oct 8, 2024 - Java
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
-
Updated
Oct 8, 2024 - Java
Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
-
Updated
Sep 12, 2024 - Java
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
-
Updated
Oct 14, 2024 - Java
CATA.Search. Blockchain database, cata metadata query
-
Updated
Aug 19, 2021 - Java
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
-
Updated
Jun 3, 2024 - Java
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
-
Updated
Mar 5, 2024 - Java
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
-
Updated
Oct 12, 2020 - Java
Official repository of SquashQL, the SQL query engine for multi-dimensional and hierarchical analysis that empowers your SQL database
-
Updated
Jun 20, 2024 - Java
Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub
-
Updated
Feb 13, 2018 - Java
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
-
Updated
Oct 7, 2024 - Java
Convenient Dataflow pipelines for transforming data between cloud data sources
-
Updated
Feb 14, 2024 - Java
Use Remote Functions to tokenize data with DLP in BigQuery using SQL
-
Updated
Oct 1, 2024 - Java
Released May 19, 2010
- Followers
- 54 followers
- Repository
- GoogleCloudPlatform/bigquery-utils
- Website
- cloud.google.com/bigquery
- Wikipedia
- Wikipedia