A lightweight and powerful audio processing tool for Dify, enabling seamless merging of multiple audio files into a single output file.
Audio Merge is a specialized tool designed to combine multiple audio clips in sequence within the Dify environment. Whether you're stitching voice recordings, podcast segments, or sound effects, this plugin handles the merge reliably—supporting WAV natively and other popular formats (MP3, OGG, FLAC, etc.) when FFmpeg is available.
- Merge multiple audio files in upload order
- Native support for WAV format (no external dependencies)
- Support for MP3, OGG, FLAC, AAC, and more (requires FFmpeg)
- Automatic format detection from file extensions
- Graceful fallback: blocks non-WAV uploads if FFmpeg is missing with clear error messaging
- Default output format matches the first input file (e.g., MP3 + WAV → MP3)
Combines a list of audio files into one continuous audio file, preserving the sequence in which files were provided.
Parameters:
- audio_files: An array of audio files (required)
- Files are merged in the exact order they appear in this list (i.e., the order of upload)
- Supported formats: WAV (always), plus MP3/OGG/FLAC/AAC/etc. if FFmpeg is installed
- Files must share compatible audio properties (sample rate, channels); the plugin auto-normalizes them to the first file’s settings
- output_filename (optional): Custom filename for the merged audio without extension name (e.g.,
podcast_final).- If not provided, defaults to
merged_audio.<output_format>.
- If not provided, defaults to
- output_format (optional): Desired output audio format (e.g.,
wav,mp3,ogg,flac).- If not provided, defaults to the format of the first input file.
- Non-WAV output requires FFmpeg.
pydub- Optional: FFmpeg — required only for non-WAV formats (input or output)
💡 Note: If FFmpeg is not installed, the plugin will only accept WAV files and output WAV.
Created by aopstudio