Upscale compressed audio like Sony DSEE or Samsung UHQ using AI
- AI-Powered Upscaling - Restores lost high frequencies (MP3 → Lossless-like)
- Cross-Platform - Works on Windows/macOS/Linux
- Two Interfaces
- CLI for power users.
- Streamlit GUI because it's prettier.
- Python 3.8+
- FFmpeg (for MP3 support)
sudo apt install ffmpeg
brew install ffmpeg
choco install ffmpeg
git clone https://github.com/yourusername/audio_upscaler.git
cd audio_upscaler
pip install -r requirements.txt
python -c "from demucs.pretrained import get_model; get_model(name='htdemucs')"
# Single file
python main.py input.mp3 output.wav
streamlit run app.py
Opens http://localhost:8501
in your browser.
Fix: Add Python scripts folder to PATH
echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.bashrc && source ~/.bashrc
C:Users\<YOU>\AppData\Roaming\Python\Python3XX\Scripts
Edit enhance.py
:
device = torch.device("cpu") # Change from "cuda"
- Convert stereo files to mono first using Audacity/FFmpeg
- Trim files to <10 minutes if processing fails
audio_upscaler/
├── input/ # Default input folder
│ └── sample.mp3 # Example file
├── output/ # Enhanced audio saved here
├── src/
│ ├── enhance.py # Core AI processing
│ ├── main.py # CLI interface
│ └── app.py # Streamlit GUI
├── requirements.txt # Dependencies
└── README.md # This file
A: Typical results:
- MP3 128kbps → Sounds like ~256kbps
- MP3 320kbps → Near-CD quality
A: WAV is uncompressed. Use ffmpeg to convert to FLAC for smaller files:
ffmpeg -i output.wav -compression_level 12 output.flac
A: Yes! MIT License allows any use (credit appreciated but not required).