Skip to content

Comments

Add LeadingStrings benchmarks for binary and non-ASCII regex patterns#6

Closed
danmoseley wants to merge 1 commit intomainfrom
regex-redux/benchmarks
Closed

Add LeadingStrings benchmarks for binary and non-ASCII regex patterns#6
danmoseley wants to merge 1 commit intomainfrom
regex-redux/benchmarks

Conversation

@danmoseley
Copy link
Owner

Adds two new benchmark classes to exercise the LeadingStrings vs FixedDistanceSets heuristic in the regex engine:

  • Perf_Regex_LeadingStrings_BinaryData: 1MB binary corpus (PE-header-like seed duplicated), alternation of binary patterns. Validates no regression on non-text input.
  • Perf_Regex_LeadingStrings_NonAscii: ~100KB Russian text (Anna Karenina opening), alternation of Russian words. Validates no regression on non-ASCII text where the frequency heuristic bails out.

Companion to dotnet/runtime change: danmoseley/runtime#31

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant