Skip to content

feat(format): support recording list of index files and sizes#5545

Open
wjones127 wants to merge 3 commits intolance-format:mainfrom
wjones127:feat-format-index-file-sizes
Open

feat(format): support recording list of index files and sizes#5545
wjones127 wants to merge 3 commits intolance-format:mainfrom
wjones127:feat-format-index-file-sizes

Conversation

@wjones127
Copy link
Contributor

Add file sizes to IndexMetadata so we can optimize cold reads of indexes (avoid HEAD calls) and report index on-disk size without extra IO.

For discussion, see: #5456
For implementation, see: #5497

This will require a vote before it can be merged.

@github-actions github-actions bot added the enhancement New feature or request label Dec 18, 2025
@codecov
Copy link

codecov bot commented Dec 18, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Comment on lines +292 to +296
message IndexFile {
// Path relative to the index directory (e.g., "index.idx", "auxiliary.idx")
string path = 1;
// Size of the file in bytes
uint64 size_bytes = 2;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this be relative to the dataset URI and include a base_id? Otherwise how will this work if there are multiple bases?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh wait, I guess we are just saying that "a single index cannot be broken across multiple bases" which seems like a reasonable constraint?

@wjones127 wjones127 marked this pull request as ready for review January 30, 2026 19:14
@wjones127 wjones127 requested a review from westonpace January 30, 2026 19:58
@wjones127
Copy link
Contributor Author

Voted passed in #5757

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants