AI
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Dead simple FLUX LoRA training UI with LOW VRAM support
Add Automatic Captions to YouTube Shorts with AI
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Automate browser-based workflows with LLMs and Computer Vision
A natural language interface for computers
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
The official gpt4free repository | various collection of powerful language models
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Official release of InternLM2.5 base and chat models. 1M context support
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
📃 A better UX for chat, writing content, and coding with LLMs.
AirLLM 70B inference with single 4GB GPU
Prompt, run, edit, and deploy full-stack web applications
An open-source RAG-based tool for chatting with your documents.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
A full-featured, hackable Next.js AI chatbot built by Vercel
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Prompt, run, edit, and deploy full-stack web applications using any LLM you want!