Skip to content

A hub of third-party providers and tutorials to help you instantly apply various NLP techniques.

License

Notifications You must be signed in to change notification settings

nicolay-r/nlp-thirdgate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nlp-thirdgate twitter

twitter

A hub of third-party NLP providers and tutorials to help you instantly handle your data iterator with no-string dependency apps.

The purpose is of this project is to share Third-party providers that could be combined into a single pipeline.

Third-Party Providers

Data Iterators

In this project we consider that each provider represent a wrapper over third-party app to handle iterator of data. We consider dict python type for representing each record of the data.

Pipeline Formation

If you wish to use several third-party providers all together for a data-iterators, it is recommented to adopt AREkit framework as a no-string solution for deploying pipeline that support batching mode.

No-string Application

  • bulk-chain -- framework for reasoning over your tabular data rows with any provided LLM
  • bulk-ner -- framework for a quick third-party models binding for entities extraction from cells of long tabular data
  • bulk-translate -- framework for translation of a massive stream of texts with native support of pre-annotated fixed-spans that are invariant for translator.
  • AREkit pipelines -- toolkit for handling your textual data iterators with various NLP providers