Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Interact with your documents using the power of GPT, 100% privately, no data leaks
🙌 OpenHands: Code Less, Make More
real time face swap and one-click video deepfake with only a single image
A Gradio web UI for Large Language Models with support for multiple inference backends.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
A community-maintained Python framework for creating mathematical animations.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Documentation that simply works
DSPy: The framework for programming—not prompting—language models
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
An open-source RAG-based tool for chatting with your documents.
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Convert PDF to markdown + JSON quickly with high accuracy
Fully open reproduction of DeepSeek-R1
Official inference repo for FLUX.1 models
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Stable Diffusion with Core ML on Apple Silicon
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
OCR, layout analysis, reading order, table recognition in 90+ languages
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
pix2tex: Using a ViT to convert images of equations into LaTeX code.
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"