-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Description
Describe the bug
- While working on Upgrade to arrow 56.1.0 datafusion#17275
I am trying to show that when we disable the cache by setting the max size to zero, the cache doesn't pull get used. To do this I was using the records_read_from_cache metric
To my surprise, it reported rows being read from the cache even when the cache was disabled
I found that the metric reports rows that were read from the "local" cache in addition to the actual global cache:
Specifically, in this code:
https://github.com/apache/arrow-rs/blob/main/parquet/src/arrow/array_reader/cached_array_reader.rs#L202-L235
To Reproduce
I set the max_predicate_cache_size size to zero (disable the cache), and the looked at this metric:
Expected behavior
I expect only rows read from the "global cache" (that may consume memory) to be included in the metrics
Additional context