Replicate data from MySQL, Postgres and MongoDB to ClickHouse®
-
Updated
Mar 13, 2025 - Python
Replicate data from MySQL, Postgres and MongoDB to ClickHouse®
CDC noticeboard scraper
Automated Pipeline to Generate FTP Files and Manage Submission of Sequence Data to Public Repositories
Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.
A data and analytics engineering platform designed for real-time sports betting analytics.
An acquisition and processing toolkit for open access phenology data.
Epidemiological weeks calculation based on the US CDC (MMWR) and ISO week numbering systems
This project aims IIT KGP students who are sitting for placement and internships to get the alerts on time
A Python package for the National Syndromic Surveillance Program (NSSP) and its Community of Practice. A collection of classes and methods to advance the practice of Syndromic Surveillance.
Keep in sync RDB table with Hive structured store. Added Kafka as a buffer between those two tables.
This project tries to give a glimpse of the different variants of SAR-CoV-2 in the world.
Code to set up CDC applications Enable change data capture on RDS for MySQL applications that are using XA transactions blog post
This is a tryout I prepared to demonstrate CDC (change data capture) using MySQL, Maxwell and Kafka.
An ensemble of BERTs for classifying injury narratives
How to build a complete Data Platform -> Here
Add a description, image, and links to the cdc topic page so that developers can more easily learn about it.
To associate your repository with the cdc topic, visit your repo's landing page and select "manage topics."