
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
Lab for testing different Flink job latency optimization techniques covered in a Flink Forward 2021 talk
Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.
Learn the basics of Apache Kafka® from leaders in the Kafka community with these video courses covering the Kafka ecosystem and hands-on exercises.
Flink CDC is a streaming data integration tool
Scripts and samples to support Confluent Demos, Talks, and Blogs. Not all of the examples in this repository are kept up to date. For automated tutorials and QA'd code, see https://github.com/confl…
A data generator source connector for Flink SQL based on data-faker.
😎 A curated list of amazingly awesome Flink and Flink ecosystem resources
This is a repo with links to everything you'd ever want to learn about data engineering
Apache Doris is an easy-to-use, high performance and unified analytics database.
Composable building blocks to build Llama Apps
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
Reproduction study of the paper "Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts"
⭐Build a stunning portfolio website with Next.js, Tailwind CSS and Framer-motion. If you want to learn to create this you can follow the tutorial link given in the Read me file.
🔥 A Complete List of GitHub Profile Badges and Achievements 🔥
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
The athena adapter plugin for dbt (https://getdbt.com)
AI tool to build charts based on text input
This AWS SNS client library allows to publish messages to SNS that exceed the 256 KB message size limit.
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Code and documentation for the MariTalk API