Skip to content

[C++] Add find_substring_regex kernel and implement ignore_case for find_substring #28857

@asfimport

Description

@asfimport

The find_substring compute function uses the MatchSubstringOptions Options class.  However, when I try to set ignore_case to TRUE, I get the following error:

 Error: NotImplemented: find_substring with ignore_case

R code to replicate the error is below, though depends on a currently unmerged branch:

df <- tibble(x = c("Foo and Bar", "baz and qux and quux"))

df %>%
      Table$create() %>%
      mutate(x = arrow_find_substring(x, options = list(pattern = "b", ignore_case = TRUE))) %>%
      collect()

Since case-insensitive search will be implemented using RE2, this is also an opportunity to add a find_substring_regex compute function.

Reporter: Nicola Crane / @thisisnic
Assignee: David Li / @lidavidm

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-13157. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions