ref(grouping): Cache enhancements in split form #92000

lobsterkatie · 2025-05-20T23:52:46Z

Right now, even when we're using split enhancements, the version we cache isn't split, which means that every time we pull them out of the cache, the work to split the rules has to be done all over again. This is obviously not ideal, so this PR changes the way we compute (and parse) the cached string so that it now includes the split rules.

Since the rust enhancer can only parse base64 strings including one set of rules, this is done by computing a separate string for each of the three kinds of rules - rules in their original/pre-split form, classifier rules, and contributes rules - and then concatenating them, separated by a character which can't ever appear in base64. Older base64 strings containing only the original rules and no delimiter can still be parsed, because for them, splitting on the delimiter will be a no-op.

codecov · 2025-05-21T00:12:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

⚠️ Parser warning

The parser emitted a warning. Please review your JUnit XML file:

Warning while parsing testcase attributes: Limit of string is 1000 chars, for name, we got 2083 at 1:157235 in /home/runner/work/sentry/sentry/.artifacts/pytest.junit.xml

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #92000       +/-   ##
===========================================
+ Coverage   36.78%   87.63%   +50.84%     
===========================================
  Files        9662    10359      +697     
  Lines      540758   587563    +46805     
  Branches    22604    22604               
===========================================
+ Hits       198930   514885   +315955     
+ Misses     341407    72257   -269150     
  Partials      421      421

yuvmen

looks good 👍

Right now, even when we're using split enhancements, the version we cache isn't split, which means that every time we pull them out of the cache, the work to split the rules has to be done all over again. This is obviously not ideal, so this PR changes the way we compute (and parse) the cached string so that it now includes the split rules. Since the rust enhancer can only parse base64 strings including one set of rules, this is done by computing a separate string for each of the three kinds of rules - rules in their original/pre-split form, classifier rules, and contributes rules - and then concatenating them, separated by a character which can't ever appear in base64. Older base64 strings containing only the original rules and no delimiter can still be parsed, because for them, splitting on the delimiter will be a no-op.

lobsterkatie added 6 commits May 20, 2025 15:54

iterate over a list in from_base64_string

9b09ac7

create the list by splitting

6dcf310

iterate over a list in base64_string

635b2da

handle concatenated base64 strings

912e326

include split rules in base64 string

fa9a585

fix test

7bc597e

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label May 20, 2025

lobsterkatie marked this pull request as ready for review May 21, 2025 01:30

lobsterkatie requested a review from a team as a code owner May 21, 2025 01:30

yuvmen approved these changes May 22, 2025

View reviewed changes

lobsterkatie merged commit 6a4e926 into master May 22, 2025
62 checks passed

lobsterkatie deleted the kmclb-cache-enhancements-in-split-form branch May 22, 2025 18:53

github-actions bot locked and limited conversation to collaborators Jun 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ref(grouping): Cache enhancements in split form #92000

ref(grouping): Cache enhancements in split form #92000

Uh oh!

lobsterkatie commented May 20, 2025 •

edited

Loading

Uh oh!

codecov bot commented May 21, 2025 •

edited

Loading

Uh oh!

yuvmen left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ref(grouping): Cache enhancements in split form #92000

ref(grouping): Cache enhancements in split form #92000

Uh oh!

Conversation

lobsterkatie commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

⚠️ Parser warning

Uh oh!

yuvmen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lobsterkatie commented May 20, 2025 •

edited

Loading

codecov bot commented May 21, 2025 •

edited

Loading