
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
An extremely fast Python package and project manager, written in Rust.
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
A MCP server to search for accurate academic articles.
Train your AI self, amplify you, bridge the world
🔥 Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes 🔥
FastOpenAPI is a library for generating and integrating OpenAPI schemas using Pydantic v2 and various frameworks (AioHttp, Falcon, Flask, Quart, Sanic, Starlette, Tornado).
Create web-based user interfaces with Python. The nice way.
A self-hosted dashboard that puts all your feeds in one place
DuckDB is an analytical in-process SQL database management system
Weave your codebase into a single, navigable Markdown document
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Integrate the DeepSeek API into popular softwares
The Metadata Platform for your Data and AI Stack
Python tool for converting files and office documents to Markdown.
Kyanos is a networking analysis tool using eBPF. It can visualize the time packets spend in the kernel, capture requests/responses, makes troubleshooting more efficient.
The official Python SDK for Model Context Protocol servers and clients
Notes talking about the design and implementation of Apache Spark
Self-hosted game stream host for Moonlight.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics
Share a single keyboard and mouse between multiple computers.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN