Stars
This is a repo with links to everything you'd ever want to learn about data engineering
The official repository for the Rock the JVM Spark Essentials with Scala course
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…
Apache Superset is a Data Visualization and Data Exploration Platform
Visualize dependencies between Airflow DAGs
Chromium Binary for AWS Lambda and Google Cloud Functions
Recaptcha solver for puppeteer.
An Amazon Athena driver for Metabase 0.32 and later
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Custom Jest Assertions for Serverless integration testing.
Github profile README.md with dynamic images generated from React.js components. Inspired by natemoo-re
😎 A curated list of awesome GitHub Profile which updates in real time
A Data Engineering & Machine Learning Knowledge Hub
A guide for using AWS Batch jobs with Fargate from CloudFormation
Import and export tools for elasticsearch & opensearch
Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS
A plugin to sync local directories and S3 prefixes for Serverless Framework ⚡
AWS Step Functions plugin for Serverless Framework ⚡️
Curated list of resources about Apache Airflow
This repo provides a managed SageMaker jupyter notebook with a number of notebooks for hands on workshops in data lakes, AI/ML, Batch, IoT, and Genomics.
An end-to-end serverless application that extracts thumbnails from video files using AWS Fargate, AWS Lambda and the Serverless Framework.
Configuration with AWS step functions and lambdas which initiates processing from activity state
🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
News program manager system (Node.js, React.js, PostgreSQL, Docker)
comparing stand up comedians using natural language processing