11
2- [ ![ DOI] ( https://zenodo.org/badge/DOI/10.5281/zenodo.5939368 .svg )] ( https://doi.org/10.5281/zenodo.5939368 )
2+ [ ![ DOI] ( https://zenodo.org/badge/DOI/10.5281/zenodo.7040475 .svg )] ( https://doi.org/10.5281/zenodo.7040475 )
33[ ![ License: GPL
44v3] ( http://img.shields.io/badge/License-GPLv3-blue.svg )] ( https://www.gnu.org/licenses/gpl-3.0 )
55[ ![ CRAN_Status_Badge] ( http://www.r-pkg.org/badges/version/RcppCWB )] ( https://cran.r-project.org/package=RcppCWB )
@@ -124,11 +124,7 @@ with a temporary registry.
124124
125125``` r
126126library(RcppCWB )
127- if (! check_pkg_registry_files()){
128- registry <- use_tmp_registry()
129- } else {
130- registry <- get_pkg_registry()
131- }
127+ registry <- use_tmp_registry()
132128```
133129
134130To start with, we get the number of tokens of the corpus.
@@ -157,21 +153,21 @@ To get the corpus positions of a token.
157153
158154``` r
159155token_to_get <- " oil"
160- id_oil <- cl_str2id(corpus = " REUTERS" , p_attribute = " word" , str = token_to_get )
161- cpos_oil <- cl_id2cpos <- cl_id2cpos(corpus = " REUTERS" , p_attribute = " word" , id = id_oil )
156+ id_oil <- cl_str2id(corpus = " REUTERS" , p_attribute = " word" , str = token_to_get , registry = registry )
157+ cpos_oil <- cl_id2cpos <- cl_id2cpos(corpus = " REUTERS" , p_attribute = " word" , id = id_oil , registry = registry )
162158```
163159
164160Get the frequency of token.
165161
166162``` r
167- oil_freq <- cl_id2freq(corpus = " REUTERS" , p_attribute = " word" , id = id_oil )
163+ oil_freq <- cl_id2freq(corpus = " REUTERS" , p_attribute = " word" , id = id_oil , registry = registry )
168164```
169165
170166Using regular expressions.
171167
172168``` r
173- ids <- cl_regex2id(corpus = " REUTERS" , p_attribute = " word" , regex = " M.*" )
174- m_words <- cl_id2str(corpus = " REUTERS" , p_attribute = " word" , id = ids )
169+ ids <- cl_regex2id(corpus = " REUTERS" , p_attribute = " word" , regex = " M.*" , registry = registry )
170+ m_words <- cl_id2str(corpus = " REUTERS" , p_attribute = " word" , id = ids , registry = registry )
175171```
176172
177173To use the CQP syntax, we need to initialize CQP first.
@@ -189,7 +185,7 @@ cqp_initialize(registry = registry)
189185cqp_query(corpus = " REUTERS" , query = ' "crude" "oil"' )
190186```
191187
192- ## NULL
188+ ## <pointer: 0x600001c743c0>
193189
194190``` r
195191cpos <- cqp_dump_subcorpus(corpus = " REUTERS" )
0 commit comments