ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
-
Updated
May 22, 2025 - Java
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
Yet another search platform for linguistic corpora.
computer tools for thai language
This is a new backend implementation of the ANNIS linguistic search and visualization system.
Your Friendly ANNIS Match Exporter
This repository contains the CEO ontology, the evaluation corpus and the CEO vocabulary.
Text corpus the of Tlingit language for linguistic research.
Our ML model calculates the biasness of a political article based on linguistic features and classifies them as biased towards the ruling government, bias towards the opposition, or neutral.
Linguistic Field Data Management and Analysis System [LiFE]
Supporting code for big-data analysis in linguistics
A Simple DOM Parser and Translation Tool using PHP, HTML, and MySQL. The translation model is supported for English to Odia language. There is a built in dictionary to support the translation.
DataLad superdataset including all the datasets currently managed by the LAAC/LSCP team
Prosodic analysis on NCSLGR corpus data
Sense Tagged Instances For Finnish
Linguistic corpora in Manx Gaelic
Word2Vec Model for Koine Greek Categorisation
Investigations into Evolutionary Linguistics using the Google Ngrams corpus. Sub-projects include Birth and Death of English Lexemes in Closed Lexical Classes | Lexicon Size
Natural Language Decorators - A collection of decorators to implement NLTK preprocessing steps.
Add a description, image, and links to the linguistic-corpora topic page so that developers can more easily learn about it.
To associate your repository with the linguistic-corpora topic, visit your repo's landing page and select "manage topics."