Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Robust Speech Recognition via Large-Scale Weak Supervision
Scrapy, a fast high-level web crawling & scraping framework for Python.
A Gradio web UI for Large Language Models with support for multiple inference backends.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
State-of-the-art 2D and 3D Face Analysis Project
☁️ Build multimodal AI applications with cloud-native stack
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Open standard for machine learning interoperability
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
OCR, layout analysis, reading order, table recognition in 90+ languages
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.