Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training with libri-small data #106

Open
alexgain opened this issue Jun 9, 2023 · 1 comment
Open

Training with libri-small data #106

alexgain opened this issue Jun 9, 2023 · 1 comment

Comments

@alexgain
Copy link

alexgain commented Jun 9, 2023

I've downloaded the data from here. I'm trying to use the small.tar dataset. I've downloaded and un-tarred the file.

But this doesn't seem to give me the correct file extensions, etc, to continue with training. For example, these commands don't output anything:

python -m vall_e.emb.qnt data/your_data
python -m vall_e.emb.g2p data/your_data

What are the additional data preparation steps I should do here? Appreciate any help on this!

Btw, the directory structure looks like this:
image

@leona
Copy link

leona commented Jun 23, 2023

I would use this dataset instead https://openslr.org/60/ it seems to be the same one the author used, as it follows the same .wav and .normalized.txt pattern and there is a config for it in this repo.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants