Add global cache for pushed times #612

bluekeyes · 2023-07-24T00:28:51Z

This might be a premature optimization, but after testing, I'm worried that listing statuses on busy repositories will end up being too expensive, even with the HTTP caching. Since these values are safe to cache forever, add a simple in-memory LRU cache in each server instance to minimize API calls.

Also add a test for the PushedAt context function that verifies the caching behavior.

This might be a premature optimization, but after testing, I'm worried that listing statuses on busy repositories will end up being too expensive, even with the HTTP caching. Since these values are safe to cache forever, add a simple in-memory LRU cache in each server instance to minimize API calls. Also add a test for the PushedAt context function that verifies the caching behavior.

bluekeyes · 2023-07-24T23:31:21Z

pull/github.go

 	ghc.pushedAt[sha] = pushedAt
+	if gc := ghc.globalCache; gc != nil {
+		gc.SetPushedAt(repoID, sha, pushedAt)


I need to fix this so that it doesn't cache the zero time, since that indicates "no pushed date" and is not globally cacheable.

Note that we might end up with the same commit having two pushed times as a result: the first evaluation will use the evaluation time and then future evaluations will use the time of the first status. I'd like to cache the evaluation time instead, but still trying to figure out how to do that in a way that makes sense, since it interacts with the commit filtering implemented at a different level.

bluekeyes · 2023-07-25T05:36:09Z

Note for myself on how to fix the caching when there are no statuses:

Remove search and commit sorting logic from the approval evaluator
Change PushedAt to always return a valid time
In PushedAt, if the commit does not have any statuses, list all commits and then work towards the head until we find a status. If we hit the head commit with no status, use the evaluation timestamp.
Record the discovered time as the push time for all SHAs evaluated in (3)

This scanning will only happen when there are ignored commits (otherwise, we're always starting with the head commit), so should be pretty rare and the global cache will also help reduce the expense.

This will also fix a bug when evaluating PRs that have a batch push ending in an ignored commit followed directly by another push containing only ignored commits.

This required a bigger change than expected because we want to cache the evaluation timestamp when we use it. This meant moving the batch checking in to the PushedAt function instead of handling it in the approval logic. The new version is closer to how things used to work when we loaded the pushedDate field.

bluekeyes mentioned this pull request Jul 24, 2023

Implement invalidate_on_push without pushedDate field #602

Merged

asvoboda approved these changes Jul 24, 2023

View reviewed changes

Base automatically changed from bkeyes/remove-pushed-date to develop July 24, 2023 21:42

bluekeyes and others added 2 commits July 24, 2023 14:43

Run builds on stacked PRs

0605729

bluekeyes force-pushed the bkeyes/cache-pushed-date branch from d1dc5db to 0605729 Compare July 24, 2023 21:43

bluekeyes commented Jul 24, 2023

View reviewed changes

bluekeyes added 2 commits July 25, 2023 14:43

Merge branch 'develop' into bkeyes/cache-pushed-date

81d031e

asvoboda approved these changes Jul 26, 2023

View reviewed changes

bluekeyes merged commit 2655fb0 into develop Jul 26, 2023
5 checks passed

bluekeyes deleted the bkeyes/cache-pushed-date branch July 26, 2023 00:58

rallydan mentioned this pull request Jul 27, 2023

Policy bot stuck on Commit hash does not have a pushed date #598

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add global cache for pushed times #612

Add global cache for pushed times #612

bluekeyes commented Jul 24, 2023

bluekeyes Jul 24, 2023

bluekeyes commented Jul 25, 2023

Add global cache for pushed times #612

Add global cache for pushed times #612

Conversation

bluekeyes commented Jul 24, 2023

bluekeyes Jul 24, 2023

Choose a reason for hiding this comment

bluekeyes commented Jul 25, 2023