Odia wikipedia monolingual corpus extraction
-
Updated
Nov 15, 2021 - Python
Odia wikipedia monolingual corpus extraction
Replication package for SO processing for bitext
ParTy2OPUS converts documents from the ParTy corpus to the OPUS format
Framework para corpus paralelos | Framework for parallel corpora
Creating (parallel) corpora from scratch using Uplug tooling
A simple and efficient tool for mining and aligning sentences with pre-trained models.
Extracting present perfects (and related forms) from parallel corpora
Parallel corpus and multilingual machine translation system of the Pali Buddhist scriptures in 15 countries(15国巴利文大藏经平行语料与多语言机器翻译系统)
Parallel corpus annotation and visualization
Code to extract multilingual parallel corpus from Press Information Bureau (PIB) website.
Pali Buddhist scriptures of 15 countries and its parallel corpus
Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 Biomedical Translation Task.
OPUS (opus.nlpl.eu) Python3 API
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
Neural Machine Translation on the Nepali-English language pair
Leeds University and King Saud University (LK) Hadith Corpus
Multilingual sentence alignment using sentence embeddings
Add a description, image, and links to the parallel-corpus topic page so that developers can more easily learn about it.
To associate your repository with the parallel-corpus topic, visit your repo's landing page and select "manage topics."