AI
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Robust Speech Recognition via Large-Scale Weak Supervision
A browser interface based on the Gradio library for OpenAI's Whisper model.
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
Using OpenAI's Whisper to automatically generate YouTube subtitles
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Official Code for DragGAN (SIGGRAPH 2023)
GPT 3.5/4 with a Chat Web UI. No API key required.
The official gpt4free repository | various collection of powerful language models | o3 and deepseek r1, gpt-4.5
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.