Find the longest non-taboo sequence

Find the longest sequence of tokens in a text without any taboo n-grams.

Usage

Given a set of taboo n-grams and a text, find the longest sequence of tokens without any of the taboo n-grams. An n-gram is represented as a string of space-separated tokens.

$ longestnontaboo.py [--len] <filenames...>

Specify taboo n-grams with one n-gram per line in taboofile; multiple filenames can be specified. By default the output is the longest non-taboo sequence. With --len, only the length of this sequence is reported.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
longest non-taboo sequence.ipynb		longest non-taboo sequence.ipynb
longestnontaboo.py		longestnontaboo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Find the longest non-taboo sequence

Usage

About

Releases

Packages

Languages

License

andreasvc/longestnontabooseq

Folders and files

Latest commit

History

Repository files navigation

Find the longest non-taboo sequence

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages