Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      MIT License
      0001Updated Oct 3, 2024Oct 3, 2024
    • Rhestr o ataleiriau Cymraeg | Welsh Stopwords List
      Creative Commons Zero v1.0 Universal
      0100Updated Sep 11, 2024Sep 11, 2024
    • MIT License
      0000Updated Aug 28, 2024Aug 28, 2024
    • TypeScript
      MIT No Attribution
      1000Updated Aug 8, 2024Aug 8, 2024
    • Casgliad cychwynnol o URLs sy'n cynnwys testun Cymraeg / An initial collection of URLs contaning Welsh-language texts
      Creative Commons Zero v1.0 Universal
      0000Updated Jul 31, 2024Jul 31, 2024
    • Corpws o sgyrsiau cymorth Cysgliad | A Corpus of support chat messages for the Cysgliad software
      Creative Commons Zero v1.0 Universal
      0000Updated Jul 29, 2024Jul 29, 2024
    • Cod gwefan Trawsgrifiwr Ar-lein gan Uned Technolegau Iaith, Prifysgol Bangor // // The code for the Trawsgrifiwr Ar-lein website by the Language Technologies Unit, Bangor University
      JavaScript
      MIT License
      1210Updated Jul 25, 2024Jul 25, 2024
    • Lecsicon cynhwysfawr o eirffurfiau'r Gymraeg yn seiliedig ar ddata gwirydd sillafu a gramadeg Cysill | A comprehensive lexicon of Welsh-language wordforms based on data from the Cysill spelling and grammar checker
      Creative Commons Zero v1.0 Universal
      2711Updated Jun 14, 2024Jun 14, 2024
    • Gweinydd syml ar gyfer ddarparu gwasanaeth API at modelau adnabod lleferydd DeepSpeech // Simple server for providing API access to DeepSpeech speech recognition models.
      Python
      MIT License
      2001Updated Apr 16, 2024Apr 16, 2024
    • Parsiwr dibyniaethau sy'n ceisio gwahaniaethu rhwng defnydd enwol a berfol o'r berfenw // A dependency parser which attempts to differentiate between nominal and verbal verbnouns
      Creative Commons Attribution Share Alike 4.0 International
      0000Updated Apr 11, 2024Apr 11, 2024
    • Tagiwr arbrofol dwieithog ar gyfer testunau Cymraeg a Saesneg | An experimental bilingual tagger for English and Welsh texts
      0000Updated Apr 11, 2024Apr 11, 2024
    • Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. | A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.
      Creative Commons Zero v1.0 Universal
      0300Updated Apr 11, 2024Apr 11, 2024
    • piper-cy

      Public
      Lleisiau all-lein Cymraeg || Welsh offline voices
      Python
      MIT License
      0100Updated Apr 3, 2024Apr 3, 2024
    • Corpws o frawddegau o destun Cymraeg wedi'u trwyddedu o dan drwydded CC0 | A corpus of Welsh texts licensed under the CC0 licence
      Creative Commons Zero v1.0 Universal
      0100Updated Mar 31, 2024Mar 31, 2024
    • Fersiwn wedi'i becynnu o spacy-lookups-data gyda data lemateiddio Cymraeg | A packaged version of spacy-lookups-data including Welsh lemmatization data
      MIT License
      0000Updated Mar 31, 2024Mar 31, 2024
    • deffro

      Public
      Python
      0000Updated Mar 26, 2024Mar 26, 2024
    • Anonymeiddiwr Beta ar gyfer testunau dwyieithog Saesneg-Cymraeg a thestunau Cymraeg uniaith.
      Python
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • Trawsgrifio ar gael drwy’r eicon microffon o fewn bysellfwrdd arferol ffon symudol
      Java
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • Rhedeg modelau adnabod lleferydd Cymraeg Whisper all-lein gyda C/C++
      C
      MIT License
      3.5k000Updated Mar 21, 2024Mar 21, 2024
    • Gweinydd gwasanaeth atgyweirio priflythrennau ac atalnodi o fewn testunau Cymraeg // Capitalization and Punctuation restoration for Welsh language texts
      Python
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • sense2vec

      Public
      🦆 Contextually-keyed word vectors
      Python
      MIT License
      238000Updated Mar 17, 2024Mar 17, 2024
    • Meddalwedd ac offer docker i weithio gyda Marian NMT | Software and tools for working with Marian NMT
      Python
      MIT License
      1200Updated Feb 28, 2024Feb 28, 2024
    • Demo o fodelu pwnc
      Python
      MIT License
      0000Updated Jan 12, 2024Jan 12, 2024
    • Casgliad o brofion Cymraeg ar gyfer modelau iaith mawr (llm) // A collection of Welsh language evals for large language models
      Shell
      MIT License
      0000Updated Nov 30, 2023Nov 30, 2023
    • Fersiwn wedi'i ddiweddaru o'r fersiwn Cymraeg o wirydd sillafu Hunspell. | An updated version of the Welsh version of the Hunspell spellchecker.
      Other
      0510Updated Nov 7, 2023Nov 7, 2023
    • Model Iaith Fectorau Word2vec ar sail corpora ymchwil yr Uned Technolegau Iaith a gasglwyd o ffynonellau amrywiol at ddibenion ymchwil fel cynhyrchu modelau iaith. | A Word2vec Language Model based on the Language Technologies Unit's research corpora.
      Python
      Apache License 2.0
      0100Updated Oct 31, 2023Oct 31, 2023
    • Fersiwn Cymraeg llafar o wirydd sillafu Hunspell. | Spoken Welsh version of the Hunspell spellchecker.
      Other
      0100Updated Oct 31, 2023Oct 31, 2023
    • C#
      MIT License
      0000Updated Oct 31, 2023Oct 31, 2023
    • spacy

      Public
      Python
      MIT License
      0000Updated Sep 11, 2023Sep 11, 2023
    • Enghraifft o god ar gyfer cyfrifo tebygrwydd brawddegau Cymraeg gan ddefnyddio spaCy / Code examples for calculating Welsh sentence similarity for spaCy
      Python
      MIT License
      0000Updated Aug 29, 2023Aug 29, 2023