Skip to content
@OpenDCAI

OpenDCAI

Define the future of Data-centric AI together

OpenDCAI

We are dedicated to advancing research and open-source tools in Data-Centric Artificial Intelligence (DCAI).

Our goal is to develop effective and efficient DCAI systems and algorithms that support and enhance the performance of AI models and applications.

Newly Released Works

🔥 2025/6/29 Our DCAI system DataFlow is released! Link

Pinned Loading

  1. DataFlow DataFlow Public

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    Python 1.4k 89

  2. MyScaleDB MyScaleDB Public

    Forked from OriginHubAI/MyScaleDB

    AI Database for unified, scalable SQL + vector data management, search and analytics

    C++ 37

Repositories

Showing 10 of 17 repositories
  • DataFlow Public

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    OpenDCAI/DataFlow’s past year of commit activity
    Python 1,362 Apache-2.0 89 10 2 Updated Sep 30, 2025
  • DataFlow-MM Public

    Dataflow-MM, multi-media operators for Dataflow. We aim to prepare data for multimedia cases.

    OpenDCAI/DataFlow-MM’s past year of commit activity
    Python 5 Apache-2.0 10 1 2 Updated Sep 30, 2025
  • RayOrch Public

    A flexible framework for orchestrating deep learning models with Ray . It dynamically schedules and serves multiple models — from NLP (e.g., FastText) to CV (e.g., YOLO, SAM) — enabling scalable, distributed, and efficient multi-model inference.

    OpenDCAI/RayOrch’s past year of commit activity
    0 0 0 0 Updated Sep 29, 2025
  • DataFlex Public

    DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

    OpenDCAI/DataFlex’s past year of commit activity
    Python 23 5 0 0 Updated Sep 29, 2025
  • DataFlex-Doc Public

    DataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.

    OpenDCAI/DataFlex-Doc’s past year of commit activity
    Python 1 3 0 0 Updated Sep 29, 2025
  • DataFlow-Doc Public

    Documentation for DataFlow, Data-centric AI system for LLM.

    OpenDCAI/DataFlow-Doc’s past year of commit activity
    Python 8 23 4 3 Updated Sep 26, 2025
  • joyagent-jdgenie Public template Forked from jd-opensource/joyagent-jdgenie

    开源的端到端产品级通用智能体

    OpenDCAI/joyagent-jdgenie’s past year of commit activity
    Java 0 Apache-2.0 1,133 0 0 Updated Sep 26, 2025
  • OpenDCAI/Awesome_MLLMs_Reasoning’s past year of commit activity
    108 5 0 0 Updated Sep 11, 2025
  • SciReasoner Public
    OpenDCAI/SciReasoner’s past year of commit activity
    Python 4 GPL-3.0 0 0 0 Updated Aug 26, 2025
  • vts-v Public
    OpenDCAI/vts-v’s past year of commit activity
    Python 10 0 0 0 Updated Aug 11, 2025

Most used topics

Loading…