Pin CacheEntryStatsCollector to fix performance bug #8385

pdillinger · 2021-06-10T21:03:30Z

Summary: If the block Cache is full with strict_capacity_limit=false,
then our CacheEntryStatsCollector could be immediately evicted on
release, so iterating through column families with shared block cache
could trigger re-scan for each CF. This change fixes that problem by
pinning the CacheEntryStatsCollector from InternalStats so that it's not
evicted.

I had originally thought that this object could participate in LRU like
everything else, but even though a re-load+re-scan only touches memory,
it can be orders of magnitude more expensive than other cache misses.
One service in Facebook has scans that take ~20s over 100GB block cache
that is mostly 4KB entries. (The up-side of this bug and #8369 is that
we had a natural experiment on the effect on some service metrics even
with block cache scans running continuously in the background--a kind
of worst case scenario. Metrics like latency were not affected enough
to trigger warnings.)

Other smaller fixes:

20s is already a sizable portion of 600s stats dump period, or 180s
default max age to force re-scan, so added logic to ensure that (for
each block cache) we don't spend more than 0.2% of our background thread
time scanning it. Nevertheless, "foreground" requests for cache entry
stats (calls to db->GetMapProperty(DB::Properties::kBlockCacheEntryStats))
are permitted to consume more CPU.

Renamed field to cache_entry_stats_ to match code style.

This change is intended for patching in 6.21 release.

Test Plan: unit test expanded to cover new logic (detect regression),
some manual testing with db_bench

Summary: If the block Cache is full with strict_capacity_limit=false, then our CacheEntryStatsCollector could be immediately evicted on release, so iterating through column families with shared block cache could trigger re-scan for each CF. This change fixes that problem by pinning the CacheEntryStatsCollector from InternalStats so that it's not evicted. I had originally thought that this object could participate in LRU like everything else, but even though a re-load+re-scan only touches memory, it can be orders of magnitude more expensive than other cache misses. One service in Facebook has scans that take ~20s over 100GB block cache that is mostly 4KB entries. (The up-side of this bug and facebook#8369 is that we had a natural experiment on the affect on some service metrics even with block cache scans running continuously in the background--a kind of worst case scenario. Metrics like latency were not affected enough to trigger warnings.) Other smaller fixes: 20s is already a sizeable portion of 600s stats dump period, or 180s default max age to force re-scan, so added logic to ensure that (for each block cache) we don't spend more than 1% of our background thread time scanning it. Renamed field to cache_entry_stats_ to match code style. This change is intended for patching in 6.21 release. Test Plan: unit test expanded to cover new logic (detect regression)

facebook-github-bot · 2021-06-10T21:05:52Z

@pdillinger has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-06-11T16:45:34Z

@pdillinger has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2021-06-11T17:00:03Z

@pdillinger has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ajkr

LGTM, adding Cache::Priority::HIGH and the percentage-based gap between scans seem like improvements.

I didn't quite understand if the goal is to entirely prevent edge cases of high collection frequency or make them less likely. My current understanding is it does the latter.

ajkr · 2021-06-12T01:57:44Z

cache/cache_entry_stats.h

+      max_age_micros = std::max(
+          max_age_micros, min_interval_factor * (last_end_time_micros_ -
+                                                 last_start_time_micros_));
+    }


Nice, this is easier to understand than the original. I guess that's due to only using std::max().

ajkr · 2021-06-12T02:24:26Z

db/internal_stats.cc

-      // difficult to access that setting from here with just cfd_
-      collector->GetStats(&cache_entry_stats);
+Status InternalStats::CollectCacheEntryStats(bool foreground) {
+  // Lazy initialize/reference the collector. It is pinned in cache (through


Assuming you mean pinned in memory, please correct if I'm mistaken...

I wonder if different InternalStatses could have different collectors forever if the initial collector each one creates is quickly ejected.

ajkr · 2021-06-12T02:36:42Z

Damn, right after submitting my review I remembered about MakeSharedCacheHandleGuard. Ignore my doubts, I agree it should entirely prevent edge cases of high collection frequency.

facebook-github-bot · 2021-06-14T15:15:20Z

@pdillinger merged this pull request in d5a46c4.

Summary: If the block Cache is full with strict_capacity_limit=false, then our CacheEntryStatsCollector could be immediately evicted on release, so iterating through column families with shared block cache could trigger re-scan for each CF. This change fixes that problem by pinning the CacheEntryStatsCollector from InternalStats so that it's not evicted. I had originally thought that this object could participate in LRU like everything else, but even though a re-load+re-scan only touches memory, it can be orders of magnitude more expensive than other cache misses. One service in Facebook has scans that take ~20s over 100GB block cache that is mostly 4KB entries. (The up-side of this bug and #8369 is that we had a natural experiment on the effect on some service metrics even with block cache scans running continuously in the background--a kind of worst case scenario. Metrics like latency were not affected enough to trigger warnings.) Other smaller fixes: 20s is already a sizable portion of 600s stats dump period, or 180s default max age to force re-scan, so added logic to ensure that (for each block cache) we don't spend more than 0.2% of our background thread time scanning it. Nevertheless, "foreground" requests for cache entry stats (calls to `db->GetMapProperty(DB::Properties::kBlockCacheEntryStats)`) are permitted to consume more CPU. Renamed field to cache_entry_stats_ to match code style. This change is intended for patching in 6.21 release. Pull Request resolved: #8385 Test Plan: unit test expanded to cover new logic (detect regression), some manual testing with db_bench Reviewed By: ajkr Differential Revision: D29042759 Pulled By: pdillinger fbshipit-source-id: 236faa902397f50038c618f50fbc8cf3f277308c

pdillinger requested a review from ajkr June 10, 2021 21:03

facebook-github-bot added the CLA Signed label Jun 10, 2021

Further refinement

b50bd01

ajkr approved these changes Jun 12, 2021

View reviewed changes

facebook-github-bot closed this in d5a46c4 Jun 14, 2021

facebook-github-bot added the Merged label Jun 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pin CacheEntryStatsCollector to fix performance bug #8385

Pin CacheEntryStatsCollector to fix performance bug #8385

pdillinger commented Jun 10, 2021 •

edited

Loading

facebook-github-bot commented Jun 10, 2021

facebook-github-bot commented Jun 11, 2021

facebook-github-bot commented Jun 11, 2021

ajkr left a comment

ajkr Jun 12, 2021

ajkr Jun 12, 2021

ajkr commented Jun 12, 2021

facebook-github-bot commented Jun 14, 2021

Pin CacheEntryStatsCollector to fix performance bug #8385

Pin CacheEntryStatsCollector to fix performance bug #8385

Conversation

pdillinger commented Jun 10, 2021 • edited Loading

facebook-github-bot commented Jun 10, 2021

facebook-github-bot commented Jun 11, 2021

facebook-github-bot commented Jun 11, 2021

ajkr left a comment

Choose a reason for hiding this comment

ajkr Jun 12, 2021

Choose a reason for hiding this comment

ajkr Jun 12, 2021

Choose a reason for hiding this comment

ajkr commented Jun 12, 2021

facebook-github-bot commented Jun 14, 2021

pdillinger commented Jun 10, 2021 •

edited

Loading