-
Notifications
You must be signed in to change notification settings - Fork 25
/
DESCRIPTION
50 lines (50 loc) · 1.94 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
Package: tokenizers
Type: Package
Title: Fast, Consistent Tokenization of Natural Language Text
Version: 0.3.1
Date: 2024-03-27
Description: Convert natural language text into tokens. Includes tokenizers for
shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs,
characters, shingled characters, lines, Penn Treebank, regular
expressions, as well as functions for counting characters, words, and sentences,
and a function for splitting longer texts into separate documents, each with
the same number of words. The tokenizers have a consistent interface, and
the package is built on the 'stringi' and 'Rcpp' packages for fast
yet correct tokenization in 'UTF-8'.
License: MIT + file LICENSE
LazyData: yes
Authors@R: c(person("Thomas", "Charlon", role = c("aut", "cre"),
email = "charlon@protonmail.com",
comment = c(ORCID = "0000-0001-7497-0470")),
person("Lincoln", "Mullen", role = c("aut"),
email = "lincoln@lincolnmullen.com",
comment = c(ORCID = "0000-0001-5103-6917")),
person("Os", "Keyes", role = c("ctb"),
email = "ironholds@gmail.com",
comment = c(ORCID = "0000-0001-5196-609X")),
person("Dmitriy", "Selivanov", role = c("ctb"),
email = "selivanov.dmitriy@gmail.com"),
person("Jeffrey", "Arnold", role = c("ctb"),
email = "jeffrey.arnold@gmail.com",
comment = c(ORCID = "0000-0001-9953-3904")),
person("Kenneth", "Benoit", role = c("ctb"),
email = "kbenoit@lse.ac.uk",
comment = c(ORCID = "0000-0002-0797-564X")))
URL: https://docs.ropensci.org/tokenizers/, https://github.com/ropensci/tokenizers
BugReports: https://github.com/ropensci/tokenizers/issues
RoxygenNote: 7.3.1
Depends:
R (>= 3.1.3)
Imports:
stringi (>= 1.0.1),
Rcpp (>= 0.12.3),
SnowballC (>= 0.5.1)
LinkingTo: Rcpp
Encoding: UTF-8
Suggests:
covr,
knitr,
rmarkdown,
stopwords (>= 0.9.0),
testthat
VignetteBuilder: knitr