Skip to content

i32 limit in JSON stats #2646

Closed
Closed
@alfredolainez

Description

@alfredolainez

Environment

Delta-rs version: 0.18.2

Binding: Python

Environment:

  • Cloud provider: AWS
  • OS: Amazon Linux 2

Bug

What happened:

When reading a DeltaLake table from Polars using pl.read_delta, I get the following error:

DeltaProtocolError: Invalid JSON in file stats: invalid value: integer 4051124561, expected i32 at line 1 column 70

which ultimately comes from deltalake:

image

What you expected to happen: My code in Python can successfully read other tables, it is just this particular table that throws this problem. The particular table where this happens is frequently accessed and I can read it successfully through Spark, so I was expecting to read it as well from deltalake. Not sure if the int32 limitation is part of the protocol or the library should allow for bigger int types.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions