AlexTTS

AlexTTS is a passion project— a text-to-speech model built from the ground up. I wanted to understand, at a low level, what an end-to-end deep learning project really means. While I've had some experience with autoregressive text generation models, it was primarily limited to fine-tuning pre-existing architectures. I had the privilege of speaking with Eli, an ML researcher at Cartesia. Our conversation left me with a single, compelling thought:

Why not build my own unique text-to-speech model?

See my blog for more information! My files are stored in apps/tts.

Credits

This repository is forked from Meta Lingua, a minimal and fast LLM training and inference library designed for research. A huge thanks to:

Mathurin Videau*, Badr Youbi Idrissi*, Daniel Haziza, Luca Wehrstedt, Jade Copet, Olivier Teytaud, David Lopez-Paz. *Equal and main contribution

License

Meta Lingua and AlexTTS are licensed under BSD-3-Clause license. Refer to the LICENSE file in the top level directory.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
apps		apps
docs		docs
lingua		lingua
setup		setup
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
mypy.ini		mypy.ini
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlexTTS

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 13

Uh oh!

Languages

License

Itisalex2/AlexTTS

Folders and files

Latest commit

History

Repository files navigation

AlexTTS

Credits

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 13

Uh oh!

Languages

Packages