Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 1.05 KB

README.md

File metadata and controls

18 lines (12 loc) · 1.05 KB

ukrainian-nlp

Code for the WECHSEL models transferred to Ukrainian:

Dictionaries

  • extra_dicts/ukrainian_wiktionary.txt — updated dict, parsed from wiktionary as of 20.05.2023. Used for the configs/experimental/gpt2/gpt2-small.oscar.nofilter.wechsel.mediumdict.json config
  • extra_dicts/ukrainian_stardict.txt — English - Ukrainian dictionary for Android • NerdCats, converted with pyglossary and aux script aux/convert_after_pygloss.py. Used for the configs/experimental/gpt2/gpt2-small.oscar.nofilter.wechsel.largedict.json config

Credits

The part of the work in this study is done on the hardware of the Ukrainian cluster of excellence of the Ukrainian Catholic University The bigger models are trained with the support from Google TRC program