Lists (2)
Sort Name ascending (A-Z)
Stars
Stable Diffusion web UI
Command-line program to download videos from YouTube.com and other video sites
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
A natural language interface for computers
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training andβ¦
A Gradio web UI for Large Language Models.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
π OpenHands: Code Less, Make More
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Instant voice cloning by MIT and MyShell.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
π° Desktop utility to download images/videos/music/text from various websites, and more.
Open-Sora: Democratizing Efficient Video Production for All
π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Google Chromium, sans integration with Google
get things from one computer to another, safely
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
The official source code repository for the calibre ebook manager
We write your reusable computer vision tools. π
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objectiveβ¦
Industry leading face manipulation platform
GUI for a Vocal Remover that uses Deep Neural Networks.
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).