Basic Audio Support, Vendor Builds
This update focuses on splitting the UI builds per vendor as some vendors support more feature than others
What’s New
Supertonic & Whisper
- Supertonic TTS model added (ONNX)
- Whisper STT models added (ONNX)
Vendor Builds
- Diffuse will now release builds per vendor.
Bug Fixes & Stability Improvements
Various fixes and internal cleanups to improve reliability during model loading and inference.
Full Changelog: v0.4.8...v0.5.1
Installation
1. Installer Version
- Uninstall Diffuse vX.X.X
- Download and run Diffuse_v0.5.1.exe
- Follow the on-screen instructions
2. Standalone Version
-
Download and extract Diffuse_v0.5.1.zip
A fast SSD with plenty of free space is recommended, as model downloads can be large. -
Run Diffuse.exe
Diffuse will automatically:
- Install an isolated portable Python runtime
- Create the required virtual environment
- Download the selected model from Hugging Face
First-run notice
On first launch or when loading a model for the first time, setup may take several minutes while Python, dependencies, and model files are downloaded and initialized. This is expected behavior.
No manual Python setup is required.
⚠️ Important (Beta Notice)
Diffuse is still in Beta, and during this phase some releases include large internal changes.
For this version, a full uninstall and reinstall is recommended to avoid upgrade issues.
Your downloaded models are not affected and can be kept.
Settings.json will be replaced during installation
If you have added custom environments, paths, or models, please back up your Settings.json file before upgrading.
These upgrade steps are temporary and expected during Beta. The goal is to stabilize updates and avoid full reinstalls for future releases.
Device Support
Download the version that matches your GPU, CUDA for Nvidia devices and ROCM for AMD devices
Note: Not all releases will contain builds for each vendor (This is a CUDA only release)