Highlights
- Pro
Starred repositories
Learn Low Level Design (LLD) and prepare for interviews using free resources.
Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems …
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
🕷️ Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Python 3 script to dump/scrape/extract company employees from XING API
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Data, Benchmarks, and methods submitted to the M5 forecasting competition
LlamaIndex is the leading framework for building LLM-powered agents over your data.
ClickHouse® is a real-time analytics database management system
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Apache Superset is a Data Visualization and Data Exploration Platform
A list of useful resources to learn Data Engineering from scratch
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An example CI leveraging Argo Workflows
The cross-platform open-source app built for handwriting
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Build cross-platform desktop apps with JavaScript, HTML, and CSS
Cross platform GUI toolkit in Go inspired by Material Design
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
A distributed SQL database with replication, fault-tolerance, tunable consistency and leader election.