Stars
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Open-Sora: Democratizing Efficient Video Production for All
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
🔊 Text-Prompted Generative Audio Model
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
🧠+🎧 Build your music algorithms and AI models with the next-gen DAW 🔥
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
RaveForce - An OpenAI Gym style toolkit for music generation experiments.
The "Hands-On Music Generation with Magenta" book code repository and info resource
Projects from the Deep Learning Specialization from deeplearning.ai provided by Coursera
generates music (midi files) using a Tensorflow RNN
Awesome music generation model——MG²
SOTA Google's Perceiver-AR Music Transformer Implementation and Model
Music remixer based on MusicGen-Chord
This is a cog implementation of the fine-tuner for Meta's MusicGen
GPT3-based Multi-Instrumental MIDI Music AI Implementation
Text prompt steered synthetic audio generators
cRNN-GAN to generate music by training on instrumental music (midi)
Genius - A Modern Next.js 14 SaaS AI Platform.
Melody of Life is a step sequencer using cellular automata
Implementation of Music Generation in PyTorch
Generating Music and Lyrics using Deep Learning via Long Short-Term Recurrent Networks (LSTMs). Implements a Char-RNN in Python using TensorFlow.
Open source implementation of OpenAi's musenet