Skip to content

Optimize object store accesses for the CSV scanner #21419

@ariel-miculas

Description

@ariel-miculas

Is your feature request related to a problem or challenge?

calculate_range issues two get_opts requests to find the beginning and the end newlines for a given file range, and once these newlines are found, there's another get_opts request for the specific range

Describe the solution you'd like

Basically reuse the AlignedBoundaryStream (#20823 ) implementation for CSV.

Describe alternatives you've considered

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request
    No fields configured for Feature.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions