Stars
The resources of the preparation course for Databricks Data Engineer Associate certification exam
Omeka S is a web publication system for universities, galleries, libraries, archives, and museums. It consists of a local network of independently curated exhibits sharing a collaboratively built p…
Examples of using Terraform to deploy Databricks resources
A curated list of awesome big data frameworks, ressources and other awesomeness.
A curated list of analytics frameworks, software and other tools.
List of data-hoarding related tools
Architecture decision record (ADR) examples for software planning, IT leadership, and template documentation
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platfor…
Flink CDC is a streaming data integration tool
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Web-based SQL editor. Legacy project in maintenance mode.
Definitions of DDD and fundamental concepts to reduce the learning curve and confusion
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
A solution to help you build automation and gitops in your Apache Kafka deployments. The Kafka gitops!
Samples and documentation for using the Amazon Neptune graph database service
Serverless application that demonstrate how to use AWS AppSync and Amazon Neptune to build a realtime, data driven application.
An Open Standard for lineage metadata collection
NeoDash - a Dashboard Builder for Neo4j
A web UI for Debezium; Please log issues at https://issues.redhat.com/browse/DBZ.
Examples for running Debezium (Configuration, Docker Compose files etc.)
This repository contains a functional example of an order delivery service similar to UberEats, DoorDash, and Instacart.
transform-to-json-string is a Single Message Transformation (SMT) for Apache Kafka® Connect to convert a given Connect Record to a single JSON String. It's an UNOFFICIAL community project.
A repository of sample code to accompany our blog post on Airflow and dbt.
A self-contained dbt project for testing purposes