ARROW-10655: [C++] Add cache and memoization facility #8716

pitrou · 2020-11-19T17:49:48Z

Implement a LRU cache, and associated memoization factories.

The simple thread-safe memoization ends up 2x to 3x slower than a thread-unsafe memoization,
in part because the thread-safe memoization has to copy the return value.

Therefore, it can be more desirable to use the thread-unsafe memoization with a thread_local specifier.

Benchmarks (arg 1: key size, arg 2: value size):

LRUCacheLookup/8/16                        1877 ns         1877 ns       359452 items_per_second=53.2795M/s
LRUCacheLookup/8/1024                      1924 ns         1924 ns       367765 items_per_second=51.9831M/s
LRUCacheLookup/64/16                       2629 ns         2628 ns       264012 items_per_second=38.0487M/s
LRUCacheLookup/64/1024                     2671 ns         2671 ns       264135 items_per_second=37.4432M/s

MemoizeLRUCached/8/16                      4922 ns         4921 ns       141748 items_per_second=20.3205M/s
MemoizeLRUCached/8/1024                    5688 ns         5687 ns       119853 items_per_second=17.5833M/s
MemoizeLRUCached/64/16                     5347 ns         5347 ns       129134 items_per_second=18.7031M/s
MemoizeLRUCached/64/1024                   6318 ns         6318 ns       110366 items_per_second=15.829M/s

MemoizeLRUCachedThreadUnsafe/8/16          2156 ns         2156 ns       321327 items_per_second=46.3885M/s
MemoizeLRUCachedThreadUnsafe/8/1024        2165 ns         2165 ns       322995 items_per_second=46.1982M/s
MemoizeLRUCachedThreadUnsafe/64/16         3110 ns         3110 ns       225295 items_per_second=32.1552M/s
MemoizeLRUCachedThreadUnsafe/64/1024       3084 ns         3084 ns       225891 items_per_second=32.4268M/s

github-actions · 2020-11-19T18:05:50Z

https://issues.apache.org/jira/browse/ARROW-10655

pitrou · 2020-11-23T13:23:00Z

Rebased.

nealrichardson · 2021-01-29T17:28:23Z

@pitrou @bkietz can we merge this?

pitrou · 2021-02-09T15:55:50Z

@nealrichardson I'll probably remove the two-level replacement cache, which doesn't seem useful after all.

pitrou · 2021-02-09T17:28:34Z

I've updated this PR with a thread-unsafe memoizer and revamped benchmarks. I'll let @bkietz take a look.

cpp/src/arrow/util/cache_internal.h

bkietz · 2021-02-17T16:54:57Z

cpp/src/arrow/util/cache_benchmark.cc

+
+static void MemoizeLRUCached(benchmark::State& state) {
+  const auto keys = MakeStrings(kCacheSize, state.range(0));
+  const auto values = MakeStrings(kCacheSize, state.range(1));


I'd also be interested to see benchmarks where the size of the string set is kCacheSize * [0.5, 0.9, 1.1, 2]

In this benchmark I'm mostly interested in measuring the overhead of the memoize pattern rather the LRU cache itself. I don't think varying the size of the string set would vary the measured overhead.

If the string set is exactly as large as the cache then the only overhead you're measuring is promotion inside a static set. By contrast if the string set is larger than the cast you will measure the cost of replacement as well. The latter seems useful to know when deciding how large to make a cache, since it represents the penalty for guessing too low.

pitrou · 2021-02-18T14:50:54Z

@bkietz Any other concerns?

emkornfield · 2021-02-18T16:37:35Z

same naming style nit I've brought up before LruCacheLookup? Maybe we should just make this another exception to the style guide?

bkietz · 2021-02-18T17:34:04Z

+1 for renaming LRUCache -> LruCache

bkietz

LGTM, no blocking concerns

Implement a LRU cache, and associated memoization factories. The simple thread-safe memoization ends up 2x to 3x slower than a thread-unsafe memoization, in part because the thread-safe memoization has to copy the return value. Therefore, it can be more desirable to use the thread-unsafe memoization with a thread_local specifier. Benchmarks (arg 1: key size, arg 2: value size): ``` LRUCacheLookup/8/16 1877 ns 1877 ns 359452 items_per_second=53.2795M/s LRUCacheLookup/8/1024 1924 ns 1924 ns 367765 items_per_second=51.9831M/s LRUCacheLookup/64/16 2629 ns 2628 ns 264012 items_per_second=38.0487M/s LRUCacheLookup/64/1024 2671 ns 2671 ns 264135 items_per_second=37.4432M/s MemoizeLRUCached/8/16 4922 ns 4921 ns 141748 items_per_second=20.3205M/s MemoizeLRUCached/8/1024 5688 ns 5687 ns 119853 items_per_second=17.5833M/s MemoizeLRUCached/64/16 5347 ns 5347 ns 129134 items_per_second=18.7031M/s MemoizeLRUCached/64/1024 6318 ns 6318 ns 110366 items_per_second=15.829M/s MemoizeLRUCachedThreadUnsafe/8/16 2156 ns 2156 ns 321327 items_per_second=46.3885M/s MemoizeLRUCachedThreadUnsafe/8/1024 2165 ns 2165 ns 322995 items_per_second=46.1982M/s MemoizeLRUCachedThreadUnsafe/64/16 3110 ns 3110 ns 225295 items_per_second=32.1552M/s MemoizeLRUCachedThreadUnsafe/64/1024 3084 ns 3084 ns 225891 items_per_second=32.4268M/s ```

pitrou · 2021-02-18T19:47:51Z

+1 for renaming LRUCache -> LruCache

Done.

github-actions bot added the Component: C++ label Nov 19, 2020

pitrou requested a review from bkietz November 19, 2020 17:52

pitrou force-pushed the ARROW-10655-lru-cache branch 2 times, most recently from 6893477 to 312b406 Compare November 23, 2020 13:22

pitrou force-pushed the ARROW-10655-lru-cache branch from 312b406 to a8fff45 Compare November 23, 2020 14:22

pitrou force-pushed the ARROW-10655-lru-cache branch 2 times, most recently from 1a3a5c9 to f24b8d6 Compare February 9, 2021 17:27

jorgecarleitao force-pushed the master branch from d4608a9 to 356c300 Compare February 14, 2021 12:09

bkietz requested changes Feb 17, 2021

View reviewed changes

pitrou force-pushed the ARROW-10655-lru-cache branch 2 times, most recently from c283ba7 to aedc40e Compare February 18, 2021 12:00

bkietz approved these changes Feb 18, 2021

View reviewed changes

pitrou force-pushed the ARROW-10655-lru-cache branch from aedc40e to 43ae445 Compare February 18, 2021 19:47

bkietz closed this in 5647e90 Feb 19, 2021

pitrou deleted the ARROW-10655-lru-cache branch February 19, 2021 15:59

asfimport mentioned this pull request Feb 19, 2021

[C++] Add LRU cache facility #26611

Closed

Uh oh!

ARROW-10655: [C++] Add cache and memoization facility #8716

ARROW-10655: [C++] Add cache and memoization facility #8716

Uh oh!

Conversation

pitrou commented Nov 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 19, 2020

Uh oh!

pitrou commented Nov 23, 2020

Uh oh!

nealrichardson commented Jan 29, 2021

Uh oh!

pitrou commented Feb 9, 2021

Uh oh!

pitrou commented Feb 9, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bkietz Feb 17, 2021

Choose a reason for hiding this comment

Uh oh!

pitrou Feb 18, 2021

Choose a reason for hiding this comment

Uh oh!

bkietz Feb 18, 2021

Choose a reason for hiding this comment

Uh oh!

pitrou commented Feb 18, 2021

Uh oh!

emkornfield commented Feb 18, 2021

Uh oh!

bkietz commented Feb 18, 2021

Uh oh!

bkietz left a comment

Choose a reason for hiding this comment

Uh oh!

pitrou commented Feb 18, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pitrou commented Nov 19, 2020 •

edited

Loading