Skip to content

Add support for regex queries on new wildcard field #54725

Closed
@markharwood

Description

@markharwood

This feature is largely about building good approximation queries on the ngram index to limit the number of documents that need verification using an automaton built from the regex.

Lucene's Regexp.toStringTree() method gives a good template for walking a parsed regex query's logic. Rather than building a string we can do something similar which builds an approximation BooleanQuery on the 3gram index. This logic will have to walk a line between:

  1. Being selective enough to efficiently narrow the set of documents considered and
  2. Avoid being overly-restrictive and introducing false negatives (ignoring docs that should match).

Metadata

Metadata

Assignees

Labels

:Search/SearchSearch-related issues that do not fall into other categories>enhancementTeam:SearchMeta label for search team

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions