[MMS] Update docs with HF TTS implementation #25907

sanchit-gandhi · 2023-09-01T10:44:29Z

What does this PR do?

Following #24085, Transformers now contains an implementation of MMS-TTS. This PR updates the MMS docs to include a code-snippet using this implementation.

cc @osanseviero since this was something you flagged before, and also @Vaibhavs10 for viz

osanseviero

Very cool! 🔥

HuggingFaceDocBuilderDev · 2023-09-01T11:05:43Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

Thanks! Left a few doc nits 😉

docs/source/en/model_doc/mms.md

ArthurZucker · 2023-09-01T13:51:18Z

docs/source/en/model_doc/mms.md

+Audio(waveform, rate=model.config.sampling_rate)
+```
+
+For certain languages with non-Roman alphabets, such as Arabic, Mandarin or Hindi, the [`uroman`](https://github.com/isi-nlp/uroman) 


Suggested change

For certain languages with non-Roman alphabets, such as Arabic, Mandarin or Hindi, the [`uroman`](https://github.com/isi-nlp/uroman)

- For certain languages with non-Roman alphabets, such as Arabic, Mandarin or Hindi, the [`uroman`](https://github.com/isi-nlp/uroman)

Why add a bullet point here?

ArthurZucker · 2023-09-01T13:52:27Z

docs/source/en/model_doc/mms.md

+If required, you should apply the uroman package to your text inputs **prior** to passing them to the `VitsTokenizer`, 
+since currently the tokenizer does not support performing the pre-processing itself.


Do you think we should add this? With a if self.is_uroman: requires_backend ? Makes sense no?

Can you also add an example of how to do this pre-processing?

It's a perl package, not a Python package, so we can't import it. The only way of running uroman is through the command line, see discussion here: #24085 (comment)

Added an example for pre-processing: d264692

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* [MMS] Update docs with HF TTS implementation * Update docs/source/en/model_doc/mms.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add uromanise to docs --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

[MMS] Update docs with HF TTS implementation

cc5f0a0

osanseviero approved these changes Sep 1, 2023

View reviewed changes

sanchit-gandhi requested a review from ArthurZucker September 1, 2023 11:32

ArthurZucker approved these changes Sep 1, 2023

View reviewed changes

sanchit-gandhi and others added 2 commits September 1, 2023 15:01

Update docs/source/en/model_doc/mms.md

daa20ff

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

add uromanise to docs

d264692

sanchit-gandhi merged commit 1fa2d89 into huggingface:main Sep 1, 2023

sanchit-gandhi deleted the mms-docs branch September 1, 2023 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MMS] Update docs with HF TTS implementation #25907

[MMS] Update docs with HF TTS implementation #25907

Uh oh!

sanchit-gandhi commented Sep 1, 2023

Uh oh!

osanseviero left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Sep 1, 2023 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

ArthurZucker Sep 1, 2023

Uh oh!

sanchit-gandhi Sep 1, 2023

Uh oh!

ArthurZucker Sep 1, 2023

Uh oh!

ArthurZucker Sep 1, 2023

Uh oh!

sanchit-gandhi Sep 1, 2023

Uh oh!

sanchit-gandhi Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	For certain languages with non-Roman alphabets, such as Arabic, Mandarin or Hindi, the [`uroman`](https://github.com/isi-nlp/uroman)
	- For certain languages with non-Roman alphabets, such as Arabic, Mandarin or Hindi, the [`uroman`](https://github.com/isi-nlp/uroman)

		If required, you should apply the uroman package to your text inputs prior to passing them to the `VitsTokenizer`,
		since currently the tokenizer does not support performing the pre-processing itself.

Uh oh!

[MMS] Update docs with HF TTS implementation #25907

[MMS] Update docs with HF TTS implementation #25907

Uh oh!

Conversation

sanchit-gandhi commented Sep 1, 2023

What does this PR do?

Uh oh!

osanseviero left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Sep 1, 2023 •

edited

Loading