OPIEC is an Open Information Extraction (OIE) corpus, consisted of more than 341M triples extracted from the entire English Wikipedia. Each triple is consisted of rich meta-data: each token from the subj/obj/rel along with NLP annotations (POS tag, NER tag, ...), provenance sentence along with the dependency parse, original (golden) links from Wikipedia, ... For more detailed explanation of the meta-data, see here.
-
Notifications
You must be signed in to change notification settings - Fork 6
Reading the data from OPIEC - an Open Information Extraction corpus
License
uma-pi1/OPIEC
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Reading the data from OPIEC - an Open Information Extraction corpus
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published