Skip to content

Commit

Permalink
bug fix related to accented chars and 'iconv()', issue #9
Browse files Browse the repository at this point in the history
  • Loading branch information
ChrisMuir committed Apr 1, 2018
1 parent f961b8f commit 9b5e108
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion R/get_fingerprint.R
Original file line number Diff line number Diff line change
Expand Up @@ -64,8 +64,8 @@ get_fingerprint_ngram <- function(vect, numgram = 2, bus_suffix = TRUE,
vect <- gsub(regex, "", vect, perl = TRUE)
# Rest of the transformations. For each value in vect: get ngrams, filter by
# unique, sort alphabetically, paste back together, and normalize encoding.
vect <- iconv(vect, to = "ASCII//TRANSLIT")
vect <- strsplit(vect, "", fixed = TRUE)
vect <- cpp_get_char_ngrams(vect, numgram = numgram)
vect <- iconv(vect, to = "ASCII//TRANSLIT")
return(vect)
}

0 comments on commit 9b5e108

Please sign in to comment.