Skip to content
View PauloCarneiro99's full-sized avatar
  • Madrid, Spain
  • 00:49 (UTC +01:00)

Block or report PauloCarneiro99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 26,559 5,434 Updated Jan 6, 2025

The official repository for the Rock the JVM Spark Essentials with Scala course

Scala 267 355 Updated Feb 10, 2025

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 726 147 Updated Feb 7, 2025

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,434 14,463 Updated Feb 14, 2025

Visualize dependencies between Airflow DAGs

HTML 49 7 Updated May 7, 2021

Chromium Binary for AWS Lambda and Google Cloud Functions

TypeScript 3,247 296 Updated Sep 3, 2024

Recaptcha solver for puppeteer.

JavaScript 594 109 Updated Apr 23, 2024

An Amazon Athena driver for Metabase 0.32 and later

Clojure 224 32 Updated Dec 8, 2022

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 38,740 14,642 Updated Feb 14, 2025

Custom Jest Assertions for Serverless integration testing.

TypeScript 195 20 Updated Feb 12, 2023

Github profile README.md with dynamic images generated from React.js components. Inspired by natemoo-re

TypeScript 225 167 Updated Apr 14, 2023

😎 A curated list of awesome GitHub Profile which updates in real time

25,599 3,858 Updated Aug 19, 2024

A Data Engineering & Machine Learning Knowledge Hub

Python 1,119 225 Updated Feb 2, 2024

A guide for using AWS Batch jobs with Fargate from CloudFormation

8 Updated Dec 11, 2020

Import and export tools for elasticsearch & opensearch

JavaScript 7,640 861 Updated Feb 14, 2025

An IaaS Implementation of ELK Stack 🎓

16 72 Updated Apr 7, 2021

Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS

Python 290 89 Updated May 19, 2023

A plugin to sync local directories and S3 prefixes for Serverless Framework ⚡

JavaScript 1 1 Updated Nov 16, 2021

AWS Step Functions plugin for Serverless Framework ⚡️

JavaScript 1,037 214 Updated Feb 14, 2025

Curated list of resources about Apache Airflow

Shell 3,737 496 Updated Aug 20, 2024

Consuming GitHub API :)

TypeScript 4 Updated Sep 4, 2020

This repo provides a managed SageMaker jupyter notebook with a number of notebooks for hands on workshops in data lakes, AI/ML, Batch, IoT, and Genomics.

Jupyter Notebook 123 60 Updated Jan 20, 2025

An end-to-end serverless application that extracts thumbnails from video files using AWS Fargate, AWS Lambda and the Serverless Framework.

JavaScript 50 11 Updated Mar 30, 2019

Configuration with AWS step functions and lambdas which initiates processing from activity state

Python 120 30 Updated Nov 30, 2024

🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos

2,383 484 Updated Apr 5, 2024

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 13,491 1,183 Updated Jul 29, 2024

News program manager system (Node.js, React.js, PostgreSQL, Docker)

JavaScript 2 1 Updated Dec 12, 2022

comparing stand up comedians using natural language processing

Jupyter Notebook 1,720 1,360 Updated Dec 31, 2022
Next