diff --git a/README.md b/README.md index 15284ca..901a5a6 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # Czech Deepspeech Model -This is an experimental deepspeech model for the Czech language. The model is under the CC-BY-NC license, mainly because it has been trained on some CC-BY-NC datasets. All the datasets are: +This is an experimental deepspeech model for the Czech language. The model is under the CC-BY-NC license. Datasets used are: - [Vystadial 2016 – Czech data](https://lindat.cz/repository/xmlui/handle/11234/1-1740) by Plátek, Ondřej ; Dušek, Ondřej ; Jurčíček, Filip (CC-BY-SA 4.0) - [OVM – Otázky Václava Moravce](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-000D-EC98-3) by Šmídl, Luboš ; Pražák, Aleš (CC-BY-NC 3.0) @@ -9,3 +9,7 @@ This is an experimental deepspeech model for the Czech language. The model is un - [Common Voice Czech](https://commonvoice.mozilla.org/en/datasets) by Mozilla (CC0) - Some private recordings and parts of audioboooks +The model has been originally transfer-learned from the [English Deepspeech/Coqui model](https://github.com/coqui-ai/STT/releases/tag/v0.9.3) version 0.9.3. + +Released scorers have been created using the [CWC 2011 Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-B847-6) by Spoustová, Johanka and Spousta, Miroslav (CC-BY 3.0) +