Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts. The model was trained on data from Freesound and the Free Music Archive, respecting creator rights.
- Stable Audio Open is an open source text-to-audio model for generating up to 47 seconds of samples and sound effects.
- Users can create drum beats, instrument riffs, ambient sounds, foley and production elements.
- The model enables audio variations and style transfer of audio samples.
This model is made to be used with the stable-audio-tools library for inference.
This tutorial consists of the following steps:
- Prerequisites
- Load the original model and inference
- Convert the model to OpenVINO IR
- Compiling models and inference
- Interactive inference
This is a self-contained example that relies solely on its own code.
We recommend running the notebook in a virtual environment. You only need a Jupyter server to start.
For details, please refer to Installation Guide.