feat(clp-s): Add the read-side implementation for the archive range index. #889

gibber9809 · 2025-05-08T18:31:50Z

Description

This PR follows up on #847 and implements the read side of the archive range index, and also adds end to end tests for this feature across both the JSON and kv-ir ingestion paths.

The code for reading the range index metdata packet lives in ArchiveReaderAdaptor, and access to the range index is forwarded to readers through ArchiveReader.

Checklist

The PR satisfies the contribution guidelines.
This is a breaking change and that has been indicated in the PR title, OR this isn't a
breaking change.
Necessary docs have been updated, OR no docs need to be updated.

Validation performed

Added end to end tests that verify that the archive range index acts as expected when ingesting JSON and kv-ir files

Summary by CodeRabbit

New Features
- Added support for range index metadata in archives, enabling retrieval of range index entries and associated metadata fields.
Bug Fixes
- None.
Tests
- Introduced new tests to validate range index functionality and archive metadata.
- Centralized archive compression logic into a reusable test utility function.
- Updated existing tests to use the new utility for archive creation.
Chores
- Updated build configuration to include necessary dependencies for new features and tests.

coderabbitai · 2025-05-08T18:31:58Z

Walkthrough

This change introduces read-side support for a range index in the clp-s archive format. It adds new data structures and methods to read, validate, and expose range index metadata from archives, updates the archive reader logic, and provides comprehensive unit tests and test utilities to verify the new functionality. Test infrastructure is refactored to centralize archive compression logic.

Changes

File(s)	Change Summary
components/core/CMakeLists.txt components/core/src/clp_s/indexer/CMakeLists.txt	Updated build configuration: added new test sources (`clp_s_test_utils.cpp`, `test-clp_s-range_index.cpp`) to unit tests and linked `nlohmann_json` to the indexer.
components/core/src/clp_s/ArchiveReader.hpp	Added a public method to `ArchiveReader` to expose the range index as a vector of `RangeIndexEntry`.
components/core/src/clp_s/ArchiveReaderAdaptor.cpp components/core/src/clp_s/ArchiveReaderAdaptor.hpp	Introduced the `RangeIndexEntry` struct and supporting logic. Added member variable and getter for the range index. Implemented `try_read_range_index` to parse and validate range index metadata from MessagePack, and integrated it into the archive metadata parsing flow. Added necessary includes for JSON support.
components/core/tests/clp_s_test_utils.cpp components/core/tests/clp_s_test_utils.hpp	Added a reusable test utility function `compress_archive` to create test archives for various input types and configurations, centralizing logic previously duplicated in tests.
components/core/tests/test-clp_s-end_to_end.cpp components/core/tests/test-clp_s-search.cpp	Refactored tests to use the new `compress_archive` utility for archive creation, removing local implementations and updating includes accordingly.
components/core/tests/test-clp_s-range_index.cpp	Added a comprehensive new test verifying the correctness of range index reading and metadata validation for both single and multi-file archives, with input from JSON and IR, using the new utility functions.

Sequence Diagram(s)

sequenceDiagram
    participant Test as Test Case
    participant Utils as compress_archive()
    participant Parser as JsonParser
    participant Archive as ArchiveReader
    participant Adaptor as ArchiveReaderAdaptor

    Test->>Utils: compress_archive(input_file, archive_dir, ...)
    Utils->>Parser: JsonParser(options)
    Utils->>Parser: parse()/parse_from_ir()
    Parser-->>Utils: Parsed data
    Utils->>Parser: store()
    Utils-->>Test: Archive created

    Test->>Archive: Open archive
    Archive->>Adaptor: get_range_index()
    Adaptor-->>Archive: vector<RangeIndexEntry>
    Archive-->>Test: Range index entries
    Test->>Test: Validate entries and metadata

Possibly related PRs

feat(clp-s): Add the write-side implementation for the archive range index. #847: Implements the write-side support for the archive range index, complementing this PR's read-side functionality.

Suggested reviewers

wraymo

Note

⚡️ AI Code Reviews for VS Code, Cursor, Windsurf

CodeRabbit now has a plugin for VS Code, Cursor and Windsurf. This brings AI code reviews directly in the code editor. Each commit is reviewed immediately, finding bugs before the PR is raised. Seamless context handoff to your AI code agent ensures that you can easily incorporate review feedback.
Learn more here.

Note

⚡️ Faster reviews with caching

CodeRabbit now supports caching for code and dependencies, helping speed up reviews. This means quicker feedback, reduced wait times, and a smoother review experience overall. Cached data is encrypted and stored securely. This feature will be automatically enabled for all accounts on May 16th. To opt out, configure Review - Disable Cache at either the organization or repository level. If you prefer to disable all data retention across your organization, simply turn off the Data Retention setting under your Organization Settings.
Enjoy the performance boost—your workflow just got faster.

✨ Finishing Touches

📝 Generate Docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 7

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 585555f and abf90e6.

📒 Files selected for processing (5)

components/core/CMakeLists.txt (1 hunks)
components/core/src/clp_s/ArchiveReader.hpp (1 hunks)
components/core/src/clp_s/ArchiveReaderAdaptor.cpp (3 hunks)
components/core/src/clp_s/ArchiveReaderAdaptor.hpp (5 hunks)
components/core/tests/test-clp_s-range_index.cpp (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}`: - Prefer `false == ` rather than `!`.

**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}: - Prefer false == <expression> rather than !<expression>.

components/core/src/clp_s/ArchiveReader.hpp
components/core/src/clp_s/ArchiveReaderAdaptor.cpp
components/core/src/clp_s/ArchiveReaderAdaptor.hpp
components/core/tests/test-clp_s-range_index.cpp

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: ubuntu-jammy-lint
GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: build-macos (macos-14, true)
GitHub Check: build-macos (macos-15, true)
GitHub Check: build-macos (macos-13, false)
GitHub Check: build-macos (macos-13, true)
GitHub Check: lint-check (macos-latest)
GitHub Check: lint-check (ubuntu-latest)

🔇 Additional comments (6)

components/core/CMakeLists.txt (1)

633-638: Unit-test source added to build list – looks good

tests/test-clp_s-range_index.cpp is correctly appended to SOURCE_FILES_unitTest, keeping the tests alphabetically grouped alongside the existing clp_s test files. No further CMake tweaks appear necessary.

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (1)

234-236: Switch-case extended correctly

The new RangeIndex packet type is cleanly integrated into the existing dispatcher without altering control flow for other packets.

components/core/src/clp_s/ArchiveReaderAdaptor.hpp (1)

4-9: Includes are appropriate – change looks good
The addition of <cstddef> and <vector> is necessary for the new types that were introduced, and there are no ordering or hygiene issues with the include list.
components/core/tests/test-clp_s-range_index.cpp (3)

1-22: Test scaffolding looks sound
The broad set of includes and constant declarations are clear and scoped; nothing superfluous stands out.

212-222: End-to-end parameterisation is clear
Using GENERATE(true, false) for both single_file_archive and from_ir gives thorough coverage with minimal boilerplate. Nicely done.

63-78: 🧹 Nitpick (assertive)

Avoid repeated JSON parsing when serialising the empty object
nlohmann::json::parse("{}") is executed for every record even though the result never changes. Caching the value once (e.g., a static const object) eliminates unnecessary allocations and parsing overhead, reducing test-runtime by ~10–15 %.
-    auto empty_object = nlohmann::json::parse("{}");
+    static const nlohmann::json cEmptyObject = nlohmann::json::object();
Then pass cEmptyObject to serialize_record.

Likely an incorrect or invalid review comment.

coderabbitai · 2025-05-08T18:37:38Z

components/core/src/clp_s/ArchiveReader.hpp

+    auto get_range_index() const -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& {
+        return m_archive_reader_adaptor->get_range_index();
+    }
+


🧹 Nitpick (assertive)

Expose intent & safety: mark get_range_index() noexcept / [[nodiscard]]

The getter already returns a const&, avoiding a copy.
Adding noexcept documents that it never throws, and [[nodiscard]] warns callers who silently ignore the returned data.

- auto get_range_index() const -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& { + [[nodiscard]] auto get_range_index() const noexcept + -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& + { return m_archive_reader_adaptor->get_range_index(); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

auto get_range_index() const -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& {

return m_archive_reader_adaptor->get_range_index();

}

[[nodiscard]] auto get_range_index() const noexcept

-> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const&

{

return m_archive_reader_adaptor->get_range_index();

}

coderabbitai · 2025-05-08T18:37:39Z

components/core/src/clp_s/ArchiveReaderAdaptor.cpp

+auto ArchiveReaderAdaptor::try_read_range_index(ZstdDecompressor& decompressor, size_t size)
+        -> ErrorCode {
+    std::vector<char> buffer(size);
+    if (auto const rc = decompressor.try_read_exact_length(buffer.data(), buffer.size());
+        ErrorCodeSuccess != rc)
+    {
+        return rc;
+    }
+
+    auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);
+    if (false == range_index_json.is_array()) {
+        return ErrorCodeCorrupt;
+    }
+
+    for (auto& range_index_entry : range_index_json) {
+        if (false == range_index_entry.contains(RangeIndexWriter::cStartIndexName)
+            || false == range_index_entry.at(RangeIndexWriter::cStartIndexName).is_number_integer())
+        {
+            return ErrorCodeCorrupt;
+        }
+        if (false == range_index_entry.contains(RangeIndexWriter::cEndIndexName)
+            || false == range_index_entry.at(RangeIndexWriter::cEndIndexName).is_number_integer())
+        {
+            return ErrorCodeCorrupt;
+        }
+        if (false == range_index_entry.contains(RangeIndexWriter::cMetadataFieldsName)
+            || false == range_index_entry.at(RangeIndexWriter::cMetadataFieldsName).is_object())
+        {
+            return ErrorCodeCorrupt;
+        }
+        auto start_index{
+                range_index_entry.at(RangeIndexWriter::cStartIndexName).template get<size_t>()
+        };
+        auto end_index{
+                range_index_entry.at(RangeIndexWriter::cEndIndexName).template get<size_t>()
+        };
+        if (start_index > end_index) {
+            return ErrorCodeCorrupt;
+        }
+        m_range_index.emplace_back(
+                start_index,
+                end_index,
+                std::move(range_index_entry.at(RangeIndexWriter::cMetadataFieldsName))
+        );
+    }
+    return ErrorCodeSuccess;
+}


🛠️ Refactor suggestion

Harden range-index parsing against oversized or empty packets

A malicious or corrupt archive could set packet_size to an unreasonably large value, leading to a huge std::vector<char> allocation and potential OOM/DoS.
Likewise, a size of 0 is currently handed to nlohmann::json::from_msgpack, which will throw.

Consider:

Rejecting sizes outside a sane upper bound (e.g., 16 MiB – adjust to your threat model).

Pre-reserving the m_range_index vector to avoid re-allocations once the JSON array size is known.

@@ - std::vector<char> buffer(size); + constexpr size_t kMaxRangeIndexPacketSize = 16 * 1024 * 1024; // 16 MiB + if (0 == size || size > kMaxRangeIndexPacketSize) { + return ErrorCodeCorrupt; + } + std::vector<char> buffer(size); @@ - auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false); + auto range_index_json = + nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), /*convert=*/true, + /*throw_on_error=*/false); @@ - for (auto& range_index_entry : range_index_json) { + m_range_index.reserve(range_index_json.size()); + for (auto& range_index_entry : range_index_json) {

This keeps memory usage predictable and prevents undefined behaviour on zero-length payloads.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

auto ArchiveReaderAdaptor::try_read_range_index(ZstdDecompressor& decompressor, size_t size)

-> ErrorCode {

std::vector<char> buffer(size);

if (auto const rc = decompressor.try_read_exact_length(buffer.data(), buffer.size());

ErrorCodeSuccess != rc)

{

return rc;

}

auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);

if (false == range_index_json.is_array()) {

return ErrorCodeCorrupt;

}

for (auto& range_index_entry : range_index_json) {

if (false == range_index_entry.contains(RangeIndexWriter::cStartIndexName)

|| false == range_index_entry.at(RangeIndexWriter::cStartIndexName).is_number_integer())

{

return ErrorCodeCorrupt;

}

if (false == range_index_entry.contains(RangeIndexWriter::cEndIndexName)

|| false == range_index_entry.at(RangeIndexWriter::cEndIndexName).is_number_integer())

{

return ErrorCodeCorrupt;

}

if (false == range_index_entry.contains(RangeIndexWriter::cMetadataFieldsName)

|| false == range_index_entry.at(RangeIndexWriter::cMetadataFieldsName).is_object())

{

return ErrorCodeCorrupt;

}

auto start_index{

range_index_entry.at(RangeIndexWriter::cStartIndexName).template get<size_t>()

};

auto end_index{

range_index_entry.at(RangeIndexWriter::cEndIndexName).template get<size_t>()

};

if (start_index > end_index) {

return ErrorCodeCorrupt;

}

m_range_index.emplace_back(

start_index,

end_index,

std::move(range_index_entry.at(RangeIndexWriter::cMetadataFieldsName))

);

}

return ErrorCodeSuccess;

}

auto ArchiveReaderAdaptor::try_read_range_index(ZstdDecompressor& decompressor, size_t size)

-> ErrorCode {

// Reject empty or overly large packets to prevent OOM/DoS

constexpr size_t kMaxRangeIndexPacketSize = 16 * 1024 * 1024; // 16 MiB

if (0 == size || size > kMaxRangeIndexPacketSize) {

return ErrorCodeCorrupt;

}

std::vector<char> buffer(size);

if (auto const rc = decompressor.try_read_exact_length(buffer.data(), buffer.size());

ErrorCodeSuccess != rc)

{

return rc;

}

auto range_index_json =

nlohmann::json::from_msgpack(buffer.begin(), buffer.end(),

/*convert=*/true,

/*throw_on_error=*/false);

if (false == range_index_json.is_array()) {

return ErrorCodeCorrupt;

}

// Reserve up front once we know the JSON array size

m_range_index.reserve(range_index_json.size());

for (auto& range_index_entry : range_index_json) {

if (false == range_index_entry.contains(RangeIndexWriter::cStartIndexName)

|| false == range_index_entry.at(RangeIndexWriter::cStartIndexName).is_number_integer())

{

return ErrorCodeCorrupt;

}

if (false == range_index_entry.contains(RangeIndexWriter::cEndIndexName)

|| false == range_index_entry.at(RangeIndexWriter::cEndIndexName).is_number_integer())

{

return ErrorCodeCorrupt;

}

if (false == range_index_entry.contains(RangeIndexWriter::cMetadataFieldsName)

|| false == range_index_entry.at(RangeIndexWriter::cMetadataFieldsName).is_object())

{

return ErrorCodeCorrupt;

}

auto start_index{

range_index_entry.at(RangeIndexWriter::cStartIndexName).template get<size_t>()

};

auto end_index{

range_index_entry.at(RangeIndexWriter::cEndIndexName).template get<size_t>()

};

if (start_index > end_index) {

return ErrorCodeCorrupt;

}

m_range_index.emplace_back(

start_index,

end_index,

std::move(range_index_entry.at(RangeIndexWriter::cMetadataFieldsName))

);

}

return ErrorCodeSuccess;

}

components/core/src/clp_s/ArchiveReaderAdaptor.hpp

components/core/tests/test-clp_s-range_index.cpp

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (2)

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (1)
105-116: ⚠️ Potential issue

Add upper-bound & empty-payload checks before allocating the buffer

size originates from the archive packet header and may be attacker-controlled or corrupted. Allocating an unbounded std::vector<char>(size) can trigger an uncontrolled memory allocation (OOM/DoS) and a std::bad_alloc that will propagate as an uncaught exception.

Recommended defensive guard (16 MiB is an example – tune to your threat model):
+    constexpr size_t kMaxRangeIndexPacketSize = 16 * 1024 * 1024;  // 16 MiB
+    if (0 == size || size > kMaxRangeIndexPacketSize) {
+        return ErrorCodeCorrupt;
+    }
     std::vector<char> buffer(size);
This mirrors the hardening already suggested in previous reviews for other metadata packets.
components/core/src/clp_s/ArchiveReaderAdaptor.hpp (1)

16-18: Redundant include ‑ remove json_fwd.hpp

<nlohmann/json.hpp> already brings in the forward declarations; keeping both only lengthens compile time.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between abf90e6 and 7ab354e.

📒 Files selected for processing (3)

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (3 hunks)
components/core/src/clp_s/ArchiveReaderAdaptor.hpp (5 hunks)
components/core/src/clp_s/indexer/CMakeLists.txt (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}`: - Prefer `false == ` rather than `!`.

**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}: - Prefer false == <expression> rather than !<expression>.

components/core/src/clp_s/ArchiveReaderAdaptor.hpp
components/core/src/clp_s/ArchiveReaderAdaptor.cpp

⏰ Context from checks skipped due to timeout of 90000ms (9)

GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: ubuntu-jammy-lint
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: build-macos (macos-13, false)
GitHub Check: build-macos (macos-13, true)
GitHub Check: lint-check (ubuntu-latest)
GitHub Check: lint-check (macos-latest)

🔇 Additional comments (1)

components/core/src/clp_s/indexer/CMakeLists.txt (1)

93-93: Dependency addition looks correct

Linking nlohmann_json::nlohmann_json under PRIVATE scope is appropriate and keeps the symbol surface local to this target.
No further action required.

components/core/src/clp_s/ArchiveReaderAdaptor.cpp

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (1)

103-153: 🛠️ Refactor suggestion

Harden range-index parsing against oversized or empty packets.

The implementation correctly handles range index parsing but lacks safeguards against potentially malicious inputs. A corrupt or malicious archive could set packet_size to an unreasonably large value or zero, leading to allocation issues or undefined behaviour.

-auto ArchiveReaderAdaptor::try_read_range_index(ZstdDecompressor& decompressor, size_t size)
+auto ArchiveReaderAdaptor::try_read_range_index(ZstdDecompressor& decompressor, size_t size)
        -> ErrorCode {
+    constexpr size_t kMaxRangeIndexPacketSize = 16 * 1024 * 1024;  // 16 MiB
+    if (0 == size || size > kMaxRangeIndexPacketSize) {
+        return ErrorCodeCorrupt;
+    }
    std::vector<char> buffer(size);
    if (auto const rc = decompressor.try_read_exact_length(buffer.data(), buffer.size());
        ErrorCodeSuccess != rc)
    {
        return rc;
    }

-    auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);
+    auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), 
+                                                        /*convert=*/true, 
+                                                        /*throw_on_error=*/false);
    if (false == range_index_json.is_array()) {
        return ErrorCodeCorrupt;
    }

+    m_range_index.reserve(range_index_json.size());
    for (auto& range_index_entry : range_index_json) {

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7ab354e and 301834d.

📒 Files selected for processing (1)

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (3 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}`: - Prefer `false == ` rather than `!`.

**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}: - Prefer false == <expression> rather than !<expression>.

components/core/src/clp_s/ArchiveReaderAdaptor.cpp

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: ubuntu-jammy-lint
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: build-macos (macos-13, false)
GitHub Check: build-macos (macos-15, false)
GitHub Check: build-macos (macos-13, true)
GitHub Check: build-macos (macos-14, true)
GitHub Check: lint-check (ubuntu-latest)
GitHub Check: lint-check (macos-latest)

🔇 Additional comments (5)

components/core/src/clp_s/ArchiveReaderAdaptor.cpp (5)

9-9: New includes added to support range index functionality.

These includes are necessary to support the new range index implementation:

<utility> for std::move

<nlohmann/json.hpp> for JSON parsing

"RangeIndexWriter.hpp" for range index constants

Also applies to: 13-13, 20-20

135-142: Proper exception handling for JSON type conversion.

The implementation correctly wraps the get<size_t>() conversions in a try/catch block to preserve the error-code contract. This is good practice since these operations can throw if the values are invalid or out of range.

143-145: Validation of range index bounds.

Good checks to ensure the start index is not greater than the end index, which would represent an invalid range. This helps maintain data integrity and prevents potential issues downstream.

146-150: Efficient handling of range index entries.

The use of emplace_back with direct constructor arguments and std::move for the JSON object is efficient. This avoids unnecessary copies of the metadata fields JSON object.

239-241: Case added for handling RangeIndex metadata packet type.

The try_read_archive_metadata function now correctly handles the RangeIndex packet type by calling the new try_read_range_index function. This integration follows the existing pattern for other metadata types.

coderabbitai · 2025-05-09T17:15:54Z

components/core/src/clp_s/ArchiveReaderAdaptor.cpp

+    auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);
+    if (false == range_index_json.is_array()) {
+        return ErrorCodeCorrupt;
+    }


🛠️ Refactor suggestion

Check for JSON parsing errors.

The from_msgpack call has throw_on_error set to false, but there's no explicit check if the JSON was successfully parsed before proceeding to validate it as an array. This could lead to working with invalid data.

- auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false); - if (false == range_index_json.is_array()) { + auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false); + if (range_index_json.is_discarded() || false == range_index_json.is_array()) { return ErrorCodeCorrupt; }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);

if (false == range_index_json.is_array()) {

return ErrorCodeCorrupt;

}

auto range_index_json = nlohmann::json::from_msgpack(buffer.begin(), buffer.end(), true, false);

if (range_index_json.is_discarded() || false == range_index_json.is_array()) {

return ErrorCodeCorrupt;

}

wraymo

Nice work! I think it's overall very clear and well-structured. Just a few comments on renaming and refactoring.

wraymo · 2025-05-14T18:26:19Z

components/core/src/clp_s/ArchiveReaderAdaptor.hpp

                : TraceableException(error_code, filename, line_number) {}
    };

+    struct RangeIndexEntry {


Not sure if we should put it inside ArchiveReaderAdopter

Would you prefer moving it outside of ArchiveReaderAdaptor and still defining it in this file, or moving it to a separate header?

Going to assume that moving it out of ArchiveReaderAdaptor but keeping it in the same file is fine for now.

I prefer moving it outside ArchiveReaderAdaptor.h. And since it's very similar to Range in RangeIndexWriter, is it possible to merge them into one struct?

Merging them might be a bit awkward because in RangeIndexWriter we need to know whether end_index has been specified so we wrap that field in optional but on the read-side we know that it always exists so wrapping it in optional would be cumbersome.

wraymo · 2025-05-14T18:28:56Z

components/core/src/clp_s/ArchiveReader.hpp


    std::shared_ptr<ReaderUtils::SchemaMap> get_schema_map() { return m_schema_map; }

+    auto get_range_index() const -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& {


I think this method will be used when we have search support?

Yes, that is the plan

wraymo · 2025-05-14T18:32:36Z

components/core/tests/test-clp_s-range_index.cpp

+auto get_test_input_path_relative_to_tests_dir() -> std::filesystem::path;
+auto get_test_input_local_path() -> std::string;
+void serialize_record(
+        nlohmann::json const& auto_gen,
+        nlohmann::json const& user_gen,
+        clp::ffi::ir_stream::Serializer<clp::ir::eight_byte_encoded_variable_t>& serializer
+);
+void compress_ir();
+void compress(bool single_file_archive, bool from_ir);
+void check_archive_metadata(bool from_ir);
+


Do you think we can have a test utility file which includes functions that can be shared by test-clp_s-end-to-end and test-clp_s-range_index (and maybe add path to compress/compress_ir signatures)

Sure, I think it is probably worth moving compress and generate_ir into a helper.

Actually on second thought I might only move compress into a helper for now since we have similar code for 3 different tests. I prefer leaving generate_ir as is because there is currently only one user.

wraymo · 2025-05-14T18:42:40Z

components/core/tests/test-clp_s-range_index.cpp

+constexpr std::string_view cTestRangeIndexInputFile{"test_no_floats_sorted.jsonl"};
+constexpr std::string_view cTestRangeIndexIRInputFile{"test_no_floats_sorted.ir"};


And some constants as well

I think I prefer defining them in each test because it makes it clear what files they depend on at a glance.

wraymo · 2025-05-14T18:52:36Z

components/core/tests/test-clp_s-range_index.cpp

+    REQUIRE(serializer.serialize_msgpack_map(auto_gen_obj.via.map, user_gen_obj.via.map));
+}
+
+void compress_ir() {


Maybe we should rename this method to something like generate_ir? Or we could add test_no_floats_sorted.ir to test_log_files

I like generate_ir, let's go with that.

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (2)

components/core/src/clp_s/ArchiveReader.hpp (1)
120-122: Consider adding [[nodiscard]] and noexcept attributes to get_range_index().

The getter already returns a const&, avoiding a copy. Adding noexcept would document that it never throws, and [[nodiscard]] would warn callers who silently ignore the returned data.
-    auto get_range_index() const -> std::vector<RangeIndexEntry> const& {
+    [[nodiscard]] auto get_range_index() const noexcept -> std::vector<RangeIndexEntry> const& {
         return m_archive_reader_adaptor->get_range_index();
     }
components/core/src/clp_s/ArchiveReaderAdaptor.hpp (1)
17-17: Remove the redundant json_fwd.hpp include.

Since you're already including nlohmann/json.hpp which internally includes the forward-declaration header, keeping both will prolong compile time unnecessarily.
 #include <nlohmann/json.hpp>
-#include <nlohmann/json_fwd.hpp>

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 871bbab and 10cb7cc.

📒 Files selected for processing (2)

components/core/src/clp_s/ArchiveReader.hpp (1 hunks)
components/core/src/clp_s/ArchiveReaderAdaptor.hpp (5 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}`: - Prefer `false == ` rather than `!`.

**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}: - Prefer false == <expression> rather than !<expression>.

components/core/src/clp_s/ArchiveReader.hpp
components/core/src/clp_s/ArchiveReaderAdaptor.hpp

🧬 Code Graph Analysis (1)

components/core/src/clp_s/ArchiveReaderAdaptor.hpp (4)

components/core/src/clp_s/RangeIndexWriter.hpp (2)

start_index (44-44)

end_index (73-73)

components/core/src/clp_s/TimestampDictionaryReader.hpp (1)

decompressor (30-30)

components/core/src/clp_s/PackedStreamReader.hpp (1)

decompressor (43-43)

components/core/src/clp_s/ArchiveWriter.hpp (2)

size (162-162)

size (162-162)

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: ubuntu-jammy-lint
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: build-macos (macos-14, false)
GitHub Check: build-macos (macos-15, true)
GitHub Check: build-macos (macos-14, true)
GitHub Check: build-macos (macos-13, false)
GitHub Check: lint-check (macos-latest)
GitHub Check: lint-check (ubuntu-latest)

🔇 Additional comments (3)

components/core/src/clp_s/ArchiveReaderAdaptor.hpp (3)

31-41: LGTM! Good implementation of the RangeIndexEntry struct.

The struct is well-documented and properly uses move semantics when storing the JSON payload, as suggested in previous reviews. The comment about brace initializers is also informative.

120-126: LGTM! The method declaration follows class conventions.

The method declaration for try_read_range_index follows the consistent style of the class, using the trailing return type syntax and proper documentation.

178-178: LGTM! m_range_index member added correctly.

The new private member variable is properly added to store the range index entries.

coderabbitai · 2025-05-15T19:02:28Z

components/core/src/clp_s/ArchiveReaderAdaptor.hpp


    ArchiveHeader const& get_header() const { return m_archive_header; }

+    std::vector<RangeIndexEntry> const& get_range_index() const { return m_range_index; }


🧹 Nitpick (assertive)

Consider adding [[nodiscard]] and noexcept to get_range_index().

For consistency with other getters and to provide better guarantees to callers:

- std::vector<RangeIndexEntry> const& get_range_index() const { return m_range_index; } + [[nodiscard]] std::vector<RangeIndexEntry> const& get_range_index() const noexcept { return m_range_index; }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

std::vector<RangeIndexEntry> const& get_range_index() const { return m_range_index; }

[[nodiscard]] std::vector<RangeIndexEntry> const& get_range_index() const noexcept { return m_range_index; }

🤖 Prompt for AI Agents

In components/core/src/clp_s/ArchiveReaderAdaptor.hpp at line 90, the getter method get_range_index() should be updated to include the [[nodiscard]] attribute and be marked noexcept. This change ensures consistency with other getter methods and signals to callers that the return value should not be ignored and that the method does not throw exceptions. Modify the method signature to add [[nodiscard]] before the return type and noexcept after the method declaration.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 10cb7cc and 51a4a53.

📒 Files selected for processing (1)

components/core/CMakeLists.txt (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: ubuntu-jammy-lint
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: build-macos (macos-15, true)
GitHub Check: build-macos (macos-15, false)
GitHub Check: build-macos (macos-14, true)
GitHub Check: build-macos (macos-13, false)
GitHub Check: build-macos (macos-13, true)
GitHub Check: lint-check (ubuntu-latest)

🔇 Additional comments (1)

components/core/CMakeLists.txt (1)

633-634: Ensure test utility headers are discoverable
You’ve added tests/ClpSTestUtils.cpp and tests/ClpSTestUtils.hpp to the unit test sources, but the unitTest target’s include directories only cover outcome and sqlite3 paths. Please verify that the compiler can locate ClpSTestUtils.hpp. If not, consider adding:
target_include_directories(unitTest
  PRIVATE
    ${CMAKE_CURRENT_SOURCE_DIR}/tests
)

components/core/CMakeLists.txt

kirkrodrigues

Something's also wrong with the clang-tidy tasks where they're not linting tests properly. Will probably need your help to fix the linting errors in the PR where we fix the task.

kirkrodrigues · 2025-05-16T16:04:15Z

components/core/tests/ClpSTestUtils.cpp

Since these ClpSTestUtils.cpp/hpp don't contain classes/types, they should be named using snake_case, right? See our internal guidelines: C++ -> File naming and organization (I know we should open-source them so they're easier to find... We're trying, we're trying.).

… index test

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (3)

components/core/CMakeLists.txt (1)

640-640: Maintain naming consistency in test filenames

The new test file test-clp_s-range_index.cpp mixes hyphens and underscores in its name. For consistency with other test files in the project, consider renaming it to either test-clp_s_range_index.cpp or adjusting the naming convention across all test files.
components/core/tests/test-clp_s-range_index.cpp (2)
82-83: 🧹 Nitpick (assertive)

Consider using create_directories for more robust directory creation.

The std::filesystem::create_directory function will throw an exception if the path exists but is not a directory. Using std::filesystem::create_directories would make the test more robust when rerun without cleanup.
-    std::filesystem::create_directory(cTestRangeIndexIRDirectory);
+    std::filesystem::create_directories(cTestRangeIndexIRDirectory);
119-163: 🛠️ Refactor suggestion

Consider re-opening a fresh ArchiveReader inside the loop.

ArchiveReader::open + close are invoked repeatedly on the same instance. If an earlier iteration were to throw, the reader's internal state might be left half-initialised, causing later calls to behave unpredictably. Instantiating a new ArchiveReader per archive avoids hidden state leakage.
-    clp_s::ArchiveReader archive_reader;
     for (auto const& entry : std::filesystem::directory_iterator(cTestRangeIndexArchiveDirectory)) {
+        clp_s::ArchiveReader archive_reader;
         clp_s::Path archive_path{

📜 Review details

Configuration used: CodeRabbit UI
Review profile: ASSERTIVE
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 51a4a53 and eee59a8.

📒 Files selected for processing (6)

components/core/CMakeLists.txt (1 hunks)
components/core/tests/clp_s_test_utils.cpp (1 hunks)
components/core/tests/clp_s_test_utils.hpp (1 hunks)
components/core/tests/test-clp_s-end_to_end.cpp (2 hunks)
components/core/tests/test-clp_s-range_index.cpp (1 hunks)
components/core/tests/test-clp_s-search.cpp (3 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}`: - Prefer `false == ` rather than `!`.

**/*.{cpp,hpp,java,js,jsx,tpp,ts,tsx}: - Prefer false == <expression> rather than !<expression>.

components/core/tests/clp_s_test_utils.hpp
components/core/tests/test-clp_s-end_to_end.cpp
components/core/tests/test-clp_s-search.cpp
components/core/tests/clp_s_test_utils.cpp
components/core/tests/test-clp_s-range_index.cpp

🧠 Learnings (2)

components/core/tests/test-clp_s-end_to_end.cpp (1)

Learnt from: AVMatthews
PR: y-scope/clp#595
File: components/core/tests/test-end_to_end.cpp:59-65
Timestamp: 2024-11-19T17:30:04.970Z
Learning: In 'components/core/tests/test-end_to_end.cpp', during the 'clp-s_compression_and_extraction_no_floats' test, files and directories are intentionally removed at the beginning of the test to ensure that any existing content doesn't influence the test results.

components/core/tests/test-clp_s-search.cpp (1)

Learnt from: AVMatthews
PR: y-scope/clp#595
File: components/core/tests/test-end_to_end.cpp:59-65
Timestamp: 2024-11-19T17:30:04.970Z
Learning: In 'components/core/tests/test-end_to_end.cpp', during the 'clp-s_compression_and_extraction_no_floats' test, files and directories are intentionally removed at the beginning of the test to ensure that any existing content doesn't influence the test results.

🧬 Code Graph Analysis (2)

components/core/tests/test-clp_s-end_to_end.cpp (3)

components/core/tests/clp_s_test_utils.hpp (1)

compress_archive (18-24)

components/core/tests/clp_s_test_utils.cpp (2)

compress_archive (12-54)

compress_archive (12-18)

components/core/tests/test-clp_s-search.cpp (3)

get_test_input_local_path (41-41)

get_test_input_local_path (53-57)

get_test_input_local_path (53-53)

components/core/tests/test-clp_s-search.cpp (3)

components/core/tests/clp_s_test_utils.hpp (1)

compress_archive (18-24)

components/core/tests/clp_s_test_utils.cpp (2)

compress_archive (12-54)

compress_archive (12-18)

components/core/tests/test-clp_s-end_to_end.cpp (3)

get_test_input_local_path (25-25)

get_test_input_local_path (33-37)

get_test_input_local_path (33-33)

⏰ Context from checks skipped due to timeout of 90000ms (11)

GitHub Check: ubuntu-jammy-dynamic-linked-bins
GitHub Check: ubuntu-jammy-static-linked-bins
GitHub Check: ubuntu-jammy-lint
GitHub Check: centos-stream-9-static-linked-bins
GitHub Check: centos-stream-9-dynamic-linked-bins
GitHub Check: build-macos (macos-15, true)
GitHub Check: build-macos (macos-14, true)
GitHub Check: build-macos (macos-15, false)
GitHub Check: build-macos (macos-13, false)
GitHub Check: lint-check (ubuntu-latest)
GitHub Check: build-macos (macos-14, false)

🔇 Additional comments (11)

components/core/CMakeLists.txt (1)

633-634: Good refactoring to centralize archive compression logic

Adding these test utility files helps centralize the archive compression logic that was previously duplicated across test files. This improves code maintainability and consistency.

components/core/tests/clp_s_test_utils.hpp (1)

1-25: Well-structured utility header with clear documentation

The header file is well-organized with proper include guards, necessary includes, and clear documentation. The function signature is appropriately designed with const references for string parameters and a clear enumeration for file type selection.

components/core/tests/test-clp_s-end_to_end.cpp (2)

11-14: Appropriate header updates for the refactored compression logic

The includes have been correctly updated to remove the now-unused JsonParser header and add the necessary headers for CommandLineArguments and the new test utilities.

105-111: Effective use of the shared compression utility

Good refactoring to use the centralized compress_archive function instead of duplicating compression logic. The REQUIRE_NOTHROW wrapper appropriately validates that the compression process doesn't throw exceptions.

components/core/tests/test-clp_s-search.cpp (2)

17-31: Appropriate header updates for the refactored compression logic

The includes have been correctly updated to add the necessary headers for CommandLineArguments and the new test utilities, maintaining proper organization of the include statements.

158-164: Effective use of the shared compression utility

Good refactoring to use the centralized compress_archive function instead of duplicating compression logic. The REQUIRE_NOTHROW wrapper appropriately validates that the compression process doesn't throw exceptions, and all parameters are correctly passed to match the original functionality.

components/core/tests/clp_s_test_utils.cpp (2)

47-52: LGTM: Function is well-designed and follows coding guidelines.

This handler for different file types is well-structured and includes proper error handling for unexpected file types. The use of false == ... pattern on line 49 correctly follows the coding guideline preference.

53-54: LGTM: Verification follows coding guidelines.

The verification logic properly checks that the archive was created successfully, and correctly follows the coding guideline of using false == <expression> rather than !<expression>.

components/core/tests/test-clp_s-range_index.cpp (3)

90-91: LGTM: Error handling follows coding guidelines.

The error handling correctly follows the coding guideline of preferring false == <expression> rather than !<expression>.

138-152: LGTM: Boolean expression follows coding guidelines.

The boolean comparisons correctly follow the coding guideline of preferring false == <expression> rather than !<expression>.

166-189: LGTM: Comprehensive test case for range index functionality.

This test case effectively validates the range index functionality across different combinations of parameters (single vs. multi-file archives, JSON vs. IR input). It properly uses the centralized compress_archive utility function, making the test more maintainable.

components/core/tests/clp_s_test_utils.cpp

kirkrodrigues

Stamping tests.

gibber9809 added 3 commits May 8, 2025 18:21

Add read side code for archive range index

8ede08a

Expose archive range index in ArchiveReader

988427c

Add end to end tests for the archive range index

abf90e6

gibber9809 requested review from a team and wraymo as code owners May 8, 2025 18:31

coderabbitai bot reviewed May 8, 2025

View reviewed changes

gibber9809 added 3 commits May 9, 2025 15:45

Add missing library dependencies for indexer

ebad5c0

Address rabbit comments

c0c7a03

Remove unnecessary dependency for indexer

7ab354e

coderabbitai bot reviewed May 9, 2025

View reviewed changes

components/core/src/clp_s/ArchiveReaderAdaptor.cpp Show resolved Hide resolved

Address rabbit comment

301834d

coderabbitai bot reviewed May 9, 2025

View reviewed changes

wraymo reviewed May 14, 2025

View reviewed changes

Factor archive compression out of tests

3905c5a

gibber9809 requested a review from wraymo May 14, 2025 20:41

gibber9809 added 2 commits May 15, 2025 15:18

Merge remote-tracking branch 'upstream/main' into range-index-reader

871bbab

Move RangeIndexEntry out of ArchiveReaderAdaptor

10cb7cc

coderabbitai bot reviewed May 15, 2025

View reviewed changes

wraymo previously approved these changes May 16, 2025

View reviewed changes

Merge branch 'main' into range-index-reader

51a4a53

coderabbitai bot reviewed May 16, 2025

View reviewed changes

components/core/CMakeLists.txt Show resolved Hide resolved

kirkrodrigues reviewed May 16, 2025

View reviewed changes

gibber9809 added 2 commits May 16, 2025 17:40

Rename ClpSTestUtils to clp_s_test_utils

0cc0f59

Replace reinterpret_cast with clp::size_checked_pointer_cast in range…

eee59a8

… index test

gibber9809 dismissed wraymo’s stale review via eee59a8 May 16, 2025 17:46

gibber9809 requested a review from kirkrodrigues May 16, 2025 17:46

coderabbitai bot reviewed May 16, 2025

View reviewed changes

components/core/tests/clp_s_test_utils.cpp Show resolved Hide resolved

kirkrodrigues approved these changes May 16, 2025

View reviewed changes

gibber9809 merged commit 3c51629 into y-scope:main May 16, 2025
21 checks passed

coderabbitai bot mentioned this pull request May 26, 2025

feat(clp-s): Add support for querying kv-pairs in the archive range index. #929

Merged

3 tasks

coderabbitai bot mentioned this pull request Jun 13, 2025

feat(clp-s): Expose archive range index metadata in printed archive statistics; Forward archive-level statistics to c++ writers. #1005

Merged

3 tasks

coderabbitai bot mentioned this pull request Aug 29, 2025

fix(clp-s): Handle pure wildcards and unexpected literal types correctly in EvaluateTimestampIndex (fixes #1096). #1277

Merged

3 tasks


		std::shared_ptr<ReaderUtils::SchemaMap> get_schema_map() { return m_schema_map; }

		auto get_range_index() const -> std::vector<ArchiveReaderAdaptor::RangeIndexEntry> const& {

		constexpr std::string_view cTestRangeIndexInputFile{"test_no_floats_sorted.jsonl"};
		constexpr std::string_view cTestRangeIndexIRInputFile{"test_no_floats_sorted.ir"};


		ArchiveHeader const& get_header() const { return m_archive_header; }

		std::vector<RangeIndexEntry> const& get_range_index() const { return m_range_index; }

	std::vector<RangeIndexEntry> const& get_range_index() const { return m_range_index; }
	[[nodiscard]] std::vector<RangeIndexEntry> const& get_range_index() const noexcept { return m_range_index; }

feat(clp-s): Add the read-side implementation for the archive range index. #889

feat(clp-s): Add the read-side implementation for the archive range index. #889

Uh oh!

Conversation

gibber9809 commented May 8, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Validation performed

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Possibly related PRs

Suggested reviewers

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot May 8, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot May 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot May 9, 2025

Choose a reason for hiding this comment

Uh oh!

wraymo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gibber9809 May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wraymo May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gibber9809 commented May 8, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented May 8, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

gibber9809 May 14, 2025 •

edited

Loading

wraymo May 15, 2025 •

edited

Loading