Toy data platform for a company that provides web analytics
-
Updated
Mar 31, 2021 - HCL
Toy data platform for a company that provides web analytics
Terraform module which creates a cluster of Apache Kafka brokers along with Yahoo CMAK and LinkedIn Cruise Control.
E2E Spark data pipelines with engineering fundamentals
Миграция базы данных из стороннего кластера Apache Kafka® в Yandex Managed Service for Apache Kafka®.
This project demonstrates how to ensure data security and compliance with industry regulations. This includes the use of GCP IAM for access control, GCP KMS for data encryption, GCP SCC for sensitive data discovery and classification, and GCP Audit Logs for logging and auditing.
Поставка данных из очереди Apache Kafka® в MongoDB с помощью Yandex Data Transfer.
Creating an Apache Kafka cluster with ZooKeeper, configured via Ansible to install dependencies
Настройка Kafka Connect для работы с кластером Yandex Managed Service for Apache Kafka®.
Terraform module which creates Amazon Machine Images (AMI) for Camellia using HashiCorp Packer and AWS CodeBuild.
Подключение к кластеру Managed Service for Apache Kafka® с помощью kafka-ui.
Зеркалирование кластеров Apache Kafka® c помощью Yandex Data Transfer.
This project focuses on maintaining data quality and consistency across different data sources. This project features Google Cloud Dataflow for data cataloging, Apache Airflow for ETL, Google Cloud Data Catalog for visual data preparation, and Snowflake for high-quality data storage and analysis.
Поставка данных из очереди Apache Kafka® в Elasticsearch с помощью Yandex Data Transfer.
This project focuses on scalable data processing and query performance optimisation. It uses Snowflake for data warehousing, GCP Cloud Functions for serverless compute, and Apache Kafka for real-time data streaming. It leverages the serverless capabilities of the systems for scalability and performance.
This project illustrates real-time data processing and analytics. This project uses Apache Kafka for capturing and streaming real-time data, GCP Cloud Functions for processing data in real-time, GCP PubSub for real-time notifications, and GCP Looker Studio for real-time data visualization.
Поставка данных из очереди Apache Kafka® в Greenplum® с помощью Yandex Data Transfer.
Instructions, code snippets, and more for the Devnexus 2023 Building Streaming Data Pipelines workshop.
Поставка данных из очереди Apache Kafka® в ClickHouse® с помощью Yandex Data Transfer.
Захват изменений из YDB и поставка в Apache Kafka® с помощью Yandex Data Transfer.
Add a description, image, and links to the apache-kafka topic page so that developers can more easily learn about it.
To associate your repository with the apache-kafka topic, visit your repo's landing page and select "manage topics."