f5-tts-mlx/examples at 0.1.1 · lucasnewman/f5-tts-mlx

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
generate.py		generate.py

README.md

Usage

To run the script, use the following format:

python generate.py --text "Your input text here"

Required Parameters

--text

string

Provide the text that you want to generate.

Optional Parameters

--duration

float

Specify the length of the generated audio in seconds.

--speed

float, default: 1.0

Speaking speed modifier, used when an exact duration is not specified.

--model

string, default: "lucasnewman/f5-tts-mlx"

Specify a custom model to use for generation. If not provided, the script will use the default model.

--ref-audio

string, default: "tests/test_en_1_ref_short.wav"

Provide a reference audio file path to help guide the generation.

--ref-text

string, default: "Some call me nature, others call me mother nature."

Provide a caption for the reference audio.

--output

string, default: "output.wav"

Specify the output path where the generated audio will be saved. If not specified, the script will save the output to a default location.

--cfg

float, default: 2.0

Specifies the strength used for classifier free guidance

--steps

int, default: 32

Specify the number of steps used to sample the neural ODE. Lower steps trade off quality for latency.

--sway-coef

float, default: -1.0

Set the sway sampling coefficient. The best values according to the paper are in the range of [-1.0...1.0].

--seed

int, default: None (random)

Set a random seed for reproducible results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Usage

Required Parameters

Optional Parameters

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Usage

Required Parameters

Optional Parameters