corpora
Here are 62 public repositories matching this topic...
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
-
Updated
Jun 12, 2025 - Python
Data repository for pretrained NLP models and NLP corpora.
-
Updated
Mar 16, 2018 - Python
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
-
Updated
Jul 27, 2023 - Python
CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
-
Updated
Jan 5, 2021 - Python
Unannotated Spanish 3 Billion Words Corpora
-
Updated
Oct 20, 2022 - Python
Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow different approach of Supervised Machine Learning.
-
Updated
Jan 1, 2019 - Python
[NLPCC 2023] CCAE: A Corpus of Chinese-based Asian Englishes
-
Updated
Dec 6, 2023 - Python
Named Entity Recognition for biomedical entities
-
Updated
Jan 11, 2023 - Python
Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
-
Updated
Nov 16, 2022 - Python
repo for Tibetan corpora
-
Updated
Apr 10, 2023 - Python
An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts
-
Updated
Jul 2, 2018 - Python
The Potsdam Twitter Sentiment Corpus
-
Updated
Jan 15, 2020 - Python
OPUS (opus.nlpl.eu) Python3 API
-
Updated
Nov 23, 2024 - Python
Measure the similarity of text corpora for 74 languages
-
Updated
Jan 26, 2024 - Python
Multilingual text corpus designed to study multilingual and cross-lingual natural language understanding (NLU) models and the strategies of localization of virtual assistants
-
Updated
Jun 15, 2025 - Python
Scripts for building a geo-located web corpus using Common Crawl data
-
Updated
Nov 3, 2025 - Python
Improve this page
Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."