Skip to content

🐸 TTS roadmap #378

Closed
Closed
@erogol

Description

These are the main dev plans for 🐸 TTS.

If you want to contribute to 🐸 TTS and don't know where to start you can pick one here and start with our Contribution Guideline. We're also always here to help.

Feel free to pick one or suggest a new one.

Contributions are always welcome 💪 .

v0.1.0 Milestones

  • Better model config handling [Discussion] Ideas for better model config management #21
  • TTS recipes for public datasets.
  • TTS trainer API to unify all the model training scripts.
  • TTS, Vocoder and SpeakerEncoder model abstractions and APIs.
  • Documentation for
    • Implementing a new model using 🐸 TTS.
    • Training a model on a new dataset from gecko.
    • Using Synthesizer interface on CLI or Server.
    • Extracting Spectrograms for Vocoder training.
    • Contributing a new pre-trained 🐸 TTS model.
    • Explanation for Model config parameters/

v0.2.0 Milestones

  • Grapheme 2 Phoneme in-house conversion. (Thx to gruut 👍 )
  • Implement VITS model.

v0.3.0 Milestones

  • Implement generic ForwardTTS API.
  • Implement Fast Speech model.
  • Implement Fast Pitch model.

v0.4.0 Milestones

v0.5.0 Milestones

  • Support for multi-lingual models
  • YourTTS release 🚀

v0.6.0 Milestones

v0.7.0 Milestones

v0.8.0 Milestones

  • Separate numpy transforms
  • Better data sampling for VITS
  • New Thorsten DE models 👑 @thorstenMueller

🏃‍♀️ Milestones along the way

🤖 New TTS models

Metadata

Assignees

No one assigned

    Labels

    TODOswontfixThis will not be worked on but feel free to help.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions