Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix README #16

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
fix README
there is no folder script instead there is folder scripts, also changed tortoise-tts-fast to tortoise-tts-fastest
  • Loading branch information
pixol20 committed Apr 21, 2024
commit 298aacfcaa7cc22d045326df119b34bf0ab401ec
22 changes: 11 additions & 11 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,8 +57,8 @@ There are two methods for installation.
The installation process is identical to the original tortoise-tts repo.

```shell
git clone https://github.com/152334H/tortoise-tts-fast
cd tortoise-tts-fast
git clone https://github.com/manmay-nakhashi/tortoise-tts-fastest.git
cd tortoise-tts-fastest
pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu117
python3 -m pip install -e .
pip3 install git+https://github.com/152334H/BigVGAN.git
Expand Down Expand Up @@ -88,41 +88,41 @@ If you are experiencing errors related to GPU usage (or lackthereof), please see
For maximum speed (and worst quality), you can try:

```sh
./script/tortoise-tts.py --half --no_cond_free --preset ultra_fast #...
./scripts/tortoise-tts.py --half --no_cond_free --preset ultra_fast #...
# or, to only generate 1 sample:
./script/tortoise-tts.py --half --no_cond_free --preset single_sample --candidates 1 #...
./scripts/tortoise-tts.py --half --no_cond_free --preset single_sample --candidates 1 #...
```

But in most cases, these settings should perform decently && fast:

```sh
./script/tortoise-tts.py --preset ultra_fast # ...
./scripts/tortoise-tts.py --preset ultra_fast # ...
```

For better quality, you might want the `very_fast` preset:

```sh
./script/tortoise-tts.py --preset very_fast # ...
./scripts/tortoise-tts.py --preset very_fast # ...
```

You can obtain outputs 100% identical to the original tortoise repo with the following command:

```sh
./script/tortoise-tts.py --preset ultra_fast_old --original_tortoise #...
./scripts/tortoise-tts.py --preset ultra_fast_old --original_tortoise #...
```

If you want to load a [fine-tuned autoregressive model](https://github.com/152334H/DL-Art-School), use the `--ar-checkpoint` argument:

```sh
./script/tortoise-tts.py --preset very_fast --ar-checkpoint /path/to/checkpoint.pth #...
./scripts/tortoise-tts.py --preset very_fast --ar-checkpoint /path/to/checkpoint.pth #...
```

## Webui

An experimental [Streamlit](https://streamlit.io/) web UI is now available. To access, run:

```bash
$ streamlit run script/app.py
$ streamlit run scripts/app.py
```

![](./static/webui_simple.png)
Expand Down Expand Up @@ -237,13 +237,13 @@ If you are on windows, you will also need to install pysoundfile: `conda install
This script allows you to speak a single phrase with one or more voices.

```shell
./script/tortoise-tts.py --text "I'm going to speak this" --voice random --preset fast
./scripts/tortoise-tts.py --text "I'm going to speak this" --voice random --preset fast
```

For reading large amounts of text:

```shell
./script/tortoise-tts.py --voice random --preset fast < textfile.txt
./scripts/tortoise-tts.py --voice random --preset fast < textfile.txt
```

This will break up the textfile into sentences, and then convert them to speech one at a time. It will output a series
Expand Down