Skip to content

Conversation

@ffranr
Copy link
Contributor

@ffranr ffranr commented Nov 7, 2025

Issue: #1865


This PR introduces a size-based proof cache and supporting deep-size logic.

Summary

  • Replace proof-count cache limits with a byte-size limit.
  • Add human-readable size parsing in configuration.
  • Introduce deep size estimation (LowerBoundByteSize) and apply it to proof cache entries.
  • Update proof cache keys for finer-grained eviction.
  • Improve error handling and logging around cache initialization.

Note

It's pretty trivial to get from here to logging cache size, but we need lightninglabs/neutrino#330 for that.

@gemini-code-assist
Copy link

Summary of Changes

Hello @ffranr, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly refactors the tapdb proof caching mechanism by transitioning from a simple count-based eviction strategy to a more sophisticated size-based approach. This change aims to provide more efficient and predictable memory usage for the proof cache. It introduces a robust deep size estimation utility for Go objects and updates the configuration to be more user-friendly with human-readable size inputs. Additionally, cache keying and management logic have been refined for better performance and error handling.

Highlights

  • Size-Based Proof Cache: The proof cache now uses a byte-size limit instead of a count-based limit, allowing for more accurate memory management. The default size is set to 32MB.
  • Deep Size Estimation: A new utility function, LowerBoundByteSize, has been introduced in the fn package to provide a conservative deep-size estimate of Go objects, including handling pointers and cycles. This is crucial for the new size-based cache.
  • Human-Readable Configuration: The configuration for the maximum proof cache size now accepts human-readable values like '32MB' or '1GB', improving user experience.
  • Finer-Grained Cache Eviction: The proof cache keys have been updated to UniverseProofKey, which combines the universe ID and leaf key, enabling more granular eviction of cached proofs.
  • Improved Error Handling and Logging: Error handling around MultiverseStore initialization has been enhanced, and logging messages for cache operations have been made more descriptive.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a significant improvement by switching the proof cache from count-based to size-based limits, incorporating deep size estimation for more accurate memory management. The changes also include finer-grained eviction strategies for cache entries and enhanced error handling and logging around cache initialization. The new LowerBoundByteSize function, utilizing reflect and unsafe, provides a conservative estimate of an object's memory footprint, which is crucial for the new cache mechanism. Comprehensive unit tests have been added for this new functionality, covering various data types and edge cases like pointer cycles. Configuration options have been updated to support human-readable size values for the cache limit. Overall, the changes align well with the stated objective of improving cache efficiency and control.

Introduce `LowerBoundByteSize` for deep-size estimation of various data
types, handling primitives, structs, slices, maps, and pointer cycles.
Add unit tests to ensure correctness and coverage.
@ffranr ffranr force-pushed the wip/limit-proof-cache-size branch from f273430 to e2c0aad Compare November 7, 2025 16:57
@ffranr ffranr changed the base branch from main to 0-8-0-staging November 7, 2025 16:57
- Replace `ProofKey` with `UniverseProofKey` for clearer cache key
  distinction, enabling removal of individual proofs by leaf key
  instead of all proofs for a universe ID.
- Refactor proof cache methods: add `RemoveLeafKeyProofs` and enhance
  `RemoveUniverseProofs` for more granular control.
- Implement deep size estimation in the `Size` method to improve
  memory management accuracy.
- Change cache size limit from number of proofs to total memory size
  of all proofs, as proof sizes can vary due to inclusion of parent
  proof files.
- Replace `proofs-per-universe` configuration with
  `max-proof-cache-size`, allowing proof cache limits to be defined by
  total memory size instead of proof count.
- Add logic to parse and handle human-readable size values
  (e.g., "32MB").
- Update default configuration and related comments.
- Adjust `NewMultiverseStore` to use parsed cache size values and
  improve error handling.
@ffranr ffranr force-pushed the wip/limit-proof-cache-size branch from e2c0aad to 7e26748 Compare November 7, 2025 16:58
@coveralls
Copy link

coveralls commented Nov 7, 2025

Pull Request Test Coverage Report for Build 19179362473

Details

  • 178 of 199 (89.45%) changed or added relevant lines in 6 files are covered.
  • 95 unchanged lines in 25 files lost coverage.
  • Overall coverage increased (+0.03%) to 56.558%

Changes Missing Coverage Covered Lines Changed/Added Lines %
tapcfg/server.go 2 4 50.0%
tapdb/multiverse.go 12 14 85.71%
fn/memory.go 89 93 95.7%
tapdb/multiverse_cache.go 66 79 83.54%
Files with Coverage Reduction New Missed Lines %
commitment/tap.go 1 85.65%
fn/context_guard.go 1 91.94%
proof/courier.go 1 78.88%
tapdb/interfaces.go 1 86.67%
tapdb/multiverse_cache.go 1 91.37%
asset/group_key.go 2 72.15%
mssmt/compacted_tree.go 2 78.57%
tapdb/mssmt.go 2 90.45%
tapdb/multiverse.go 2 80.69%
tapdb/sqlc/transfers.sql.go 2 83.33%
Totals Coverage Status
Change from base Build 18942913401: 0.03%
Covered Lines: 64443
Relevant Lines: 113942

💛 - Coveralls

Also, enhance the release notes template with a section to aid in
reporting breaking config changes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants