Skip to content

General information about the WorldModelers repos related to reading and assembly

Notifications You must be signed in to change notification settings

WorldModelers/CorpusIngestionAndAssembly

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Corpus Ingestion and Assembly

General information about the WorldModelers repos related to reading and assembly

Additional documentation is available at the website for this repo.

Reading and Assembly Software Components

This section describes the machine reading and assembly repositories.

Eidos

Eidos is the machine reading system developed by the CLU lab at University of Arizona. This repository includes the reading software as well as code for integrating with DART.

Concept Discovery

This is the concept identification component, which is used by the ontology-in-a-day (OIAD) system.

HUME

Hume is BBN's machine reading system that extracts CAGs and supports the OIAD clustering. It leverages the following software

Text-Open contains Java and Python APIs for reading and writing BBN's SerifXML format, which is BBN's internal representation of documents and information.

LearnIt is a tool for customizing Machine Readers (a.k.a., Information Extraction algorithms) with human in the loop. Within WM, we also use it as a pattern-based extractor for event extraction.

NLPLingo is BBN's Deep Learning toolkit for event extraction and causal relation extraction.

CSerif contains C/C++ code for pre-reading of texts into BBN's SerifXML format. We are working on open sourcing this.

Sofia

INDRA World

Ontology

The reading and assembly software developed under World Modelers program share the same underlying ontology.

About

General information about the WorldModelers repos related to reading and assembly

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published