chore: Split explorer index projects over 23h #106563

shruthilayaj · 2026-01-20T14:06:50Z

Flagpole evaluations take about 10ms, which lets us evaluate about 60k projects
in 10 min. This is not fast enough. This PR is doing a couple things:

checks has_transaction flag since indexer only runs when transactions are enabled
splits up the projects over 23 hours (same logic as the seer job last hour is used for cleanup)

src/sentry/tasks/seer_explorer_index.py

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-01-20T14:22:26Z

src/sentry/tasks/seer_explorer_index.py

+            continue
+
+        if project.id % 23 != django_timezone.now().hour:
+            continue


Hour sharding uses modulo 23 but hours range 0-23

Medium Severity

The sharding logic project.id % 23 != django_timezone.now().hour has an off-by-one mismatch. project.id % 23 produces values 0-22 (23 distinct values), but django_timezone.now().hour produces values 0-23 (24 distinct values). When the current hour is 23, no projects will ever match because no project ID modulo 23 can equal 23. This means projects are never indexed during hour 23 (11pm) each day, creating a gap in the indexing schedule.

cursor · 2026-01-20T14:22:26Z

src/sentry/tasks/seer_explorer_index.py

+            continue
+
+        if project.id % 23 != django_timezone.now().hour:
+            continue


Hour check evaluated repeatedly can cause race condition

Medium Severity

The django_timezone.now().hour is called fresh for each project inside the RangeQuerySetWrapper loop. If the iteration takes time and crosses an hour boundary, projects evaluated before the boundary are checked against hour N, while projects evaluated after are checked against hour N+1. This causes some shard-N projects to be skipped (not processed for another 24 hours) and some shard-N+1 projects to be processed early (then again at the next scheduled run). The hour value needs to be captured once before the loop begins.

Seems unlikely to happen in prod but probably good practice in case it cases instability in tests.

sentry · 2026-01-20T14:51:55Z

src/sentry/tasks/seer_explorer_index.py

        Tuple of (project_id, organization_id)
    """
    projects = Project.objects.filter(status=ObjectStatus.ACTIVE).select_related("organization")
+    current_hour = django_timezone.now().hour

    for project in RangeQuerySetWrapper(
        projects,


Bug: The sharding logic project.id % 23 will never equal current_hour when the hour is 23, causing projects for that hour to be skipped.
_{Severity: MEDIUM}

Suggested Fix

To ensure all 24 hours are utilized, change the modulo divisor from 23 to 24. The condition should be updated to project.id % 24 == current_hour.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/tasks/seer_explorer_index.py#L40-L46 Potential issue: The project filtering logic uses the expression `project.id % 23 == current_hour` to distribute the indexing workload across 23 hours of the day. However, when the `current_hour` is 23, this condition can never be met because the result of a modulo 23 operation is always an integer between 0 and 22, inclusive. As a result, any projects that would have been scheduled for the 23rd hour will be skipped, and their data will not be indexed by the Seer service. This is a silent failure that affects a subset of projects.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

Zylphrex · 2026-01-20T15:23:40Z

src/sentry/tasks/seer_explorer_index.py

+            continue
+
+        if project.id % 23 != django_timezone.now().hour:
+            continue


Seems unlikely to happen in prod but probably good practice in case it cases instability in tests.

shruthilayaj added 2 commits January 20, 2026 08:54

chore: Split explorer index projects over 23h

6a725bf

tests

fd590b3

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Jan 20, 2026

vercel bot deployed to Preview January 20, 2026 14:09 View deployment

tests

727979d

vercel bot deployed to Preview January 20, 2026 14:16 View deployment

shruthilayaj marked this pull request as ready for review January 20, 2026 14:17

sentry bot reviewed Jan 20, 2026

View reviewed changes

src/sentry/tasks/seer_explorer_index.py Outdated Show resolved Hide resolved

cursor bot reviewed Jan 20, 2026

View reviewed changes

move hour check out of the loop

55fb4c8

shruthilayaj requested a review from a team January 20, 2026 14:49

vercel bot deployed to Preview January 20, 2026 14:51 View deployment

sentry bot reviewed Jan 20, 2026

View reviewed changes

Zylphrex approved these changes Jan 20, 2026

View reviewed changes

shruthilayaj merged commit e4603b8 into master Jan 20, 2026
66 checks passed

shruthilayaj deleted the shruthi/chore/split-explorer-index-load-over-23-h branch January 20, 2026 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

chore: Split explorer index projects over 23h #106563

chore: Split explorer index projects over 23h #106563

shruthilayaj commented Jan 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 20, 2026

Uh oh!

cursor bot Jan 20, 2026

Uh oh!

Zylphrex Jan 20, 2026

Uh oh!

sentry bot Jan 20, 2026

Uh oh!

Zylphrex Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

chore: Split explorer index projects over 23h #106563

chore: Split explorer index projects over 23h #106563

Conversation

shruthilayaj commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Jan 20, 2026

Choose a reason for hiding this comment

Hour sharding uses modulo 23 but hours range 0-23

Uh oh!

cursor bot Jan 20, 2026

Choose a reason for hiding this comment

Hour check evaluated repeatedly can cause race condition

Uh oh!

Zylphrex Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

sentry bot Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Zylphrex Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shruthilayaj commented Jan 20, 2026 •

edited

Loading