Fish Speech Batch Inference

Fish Speech Batch Inference

A Fish Speech fork with enhanced batch inference for efficient speech generation.

Batch inference allows you generate multiple audio at once instead of one by one. This makes the process much faster and saves time. To use batch inference, set up your reference audio file and texts, and configure generation settings as shown in the usage section below.

🚀 Features

Batch Processing: Handles multiple texts at once for faster inference
Stable & Efficient: No empty results, no redundant calculations, correct attention masking

🛠️ Usage

Download the codec model.
Create the fake.npy file with your reference audio and the checkpoint path:
```
python fish_speech/models/dac/inference.py \
    -i "ref_audio_name.wav" \
    --checkpoint-path "checkpoints/fish-speech-1.5/"
```
This command will generate fake.npy (specify the output path if needed).
Set the path to fake.npy in fish_batch_inference.py.
Run batch inference:
```
python fish_batch_inference.py
```

🔄 Roadmap

VQ-GAN Parallelization for even faster inference
Gradio Web UI for easy batch processing

📊 Performance

Speed: Up to 3-4x faster than sequential processing
Quality: More diverse and robust audio results

Repository: https://github.com/mkgs210/batch_fish_speech

Fish Speech fork with true batch inference. VQ-GAN and Gradio support coming soon!

Name		Name	Last commit message	Last commit date
Latest commit History 687 Commits
.github		.github
docs		docs
fish_speech		fish_speech
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.project-root		.project-root
.readthedocs.yaml		.readthedocs.yaml
API_FLAGS.txt		API_FLAGS.txt
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
dockerfile		dockerfile
dockerfile.dev		dockerfile.dev
entrypoint.sh		entrypoint.sh
fish_batch_inference.py		fish_batch_inference.py
fish_inference.py		fish_inference.py
inference.ipynb		inference.ipynb
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fish Speech Batch Inference

🚀 Features

🛠️ Usage

🔄 Roadmap

📊 Performance

About

Uh oh!

Releases

Packages

Contributors 73

Uh oh!

Languages

License

mkgs210/batch_fish_speech

Folders and files

Latest commit

History

Repository files navigation

Fish Speech Batch Inference

🚀 Features

🛠️ Usage

🔄 Roadmap

📊 Performance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 73

Uh oh!

Languages

Packages