-
Couldn't load subscription status.
- Fork 3.9k
Closed
Closed
Copy link
Description
The find_substring compute function uses the MatchSubstringOptions Options class. However, when I try to set ignore_case to TRUE, I get the following error:
Error: NotImplemented: find_substring with ignore_caseR code to replicate the error is below, though depends on a currently unmerged branch:
df <- tibble(x = c("Foo and Bar", "baz and qux and quux"))
df %>%
Table$create() %>%
mutate(x = arrow_find_substring(x, options = list(pattern = "b", ignore_case = TRUE))) %>%
collect()Since case-insensitive search will be implemented using RE2, this is also an opportunity to add a find_substring_regex compute function.
Reporter: Nicola Crane / @thisisnic
Assignee: David Li / @lidavidm
Related issues:
- [C++] String algorithm library for StringArray/BinaryArray (is a child of)
- [C++] Substring find position kernel (relates to)
- [C++] Implement case insenstive match in match_substring(_regex) and match_like (relates to)
- [R] Bindings for find_substring and find_substring_regex (is depended upon by)
PRs and other links:
Note: This issue was originally created as ARROW-13157. Please see the migration documentation for further details.