Skip to content

Lexico-syntactic pattern extractor. Uses GATE Embedded and OpenNLP pipeline.

Notifications You must be signed in to change notification settings

vborovin/lspExtractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lexico-Syntactic Pattern Extractor

Tool to automatically extract the most generic lexico-syntactic patterns from scientific texts on English.

Uses GATE Embedded, expects GATE's OpenNLP pipeline as .gapp with OpenNLP Parser step included.

Extraction process:

  • Document preprocessing (normalization, brackets removal, etc.)
  • Pipeline execution
  • Syntactic trees rebuilding from GATE annotations
  • Syntactic trees generalization
  • Extracted phrases distribution calculation
  • (TODO) Phrase filtering
  • (TODO) JAPE pattern generation

About

Lexico-syntactic pattern extractor. Uses GATE Embedded and OpenNLP pipeline.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published