
- nanjing
- https://xxzuo.github.io/
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A lightweight data processing framework built on DuckDB and 3FS.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Open source RabbitMQ: core server and tier 1 (built-in) plugins
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 600+ plugins. Alternative to Airflow, VMware vRealize Automation, Rundeck...
Always know what to expect from your data.
A curated list of open source tools used in analytics platforms and data engineering ecosystem
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
The official home of the Presto distributed SQL query engine for big data
Fluss is a streaming storage built for real-time analytics.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
alibabacloud-maxcompute-tool-migrate
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
Open, Multi-modal Catalog for Data & AI
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.