You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of Polars.
Issue description
When reading in a parquet the rechunk parameter is not respected.
Reproducible example
importnumpyasnpimportpolarsasplfrompyarrow.parquetimportread_metadataa=np.random.random((200, 5))
# We expect there to be fifty chunkspl.DataFrame(a).write_parquet('tmp.pq', row_group_size=4)
# Should output `num_row_groups: 50`print(read_metadata('tmp.pq'))
df=pl.read_parquet('tmp.pq', rechunk=True)
# We expect 1 but we get 50print(df.n_chunks())
# We expect 1 and get 1print(df.rechunk().n_chunks())
Expected behavior
importnumpyasnpimportpolarsasplfrompyarrow.parquetimportread_metadataa=np.random.random((200, 5))
# We expect there to be fifty chunkspl.DataFrame(a).write_parquet('tmp.pq', row_group_size=4)
# Should output `num_row_groups: 50`print(read_metadata('tmp.pq'))
df=pl.read_parquet('tmp.pq', rechunk=True)
# We expect 1 but we get 50print(df.n_chunks())
# We expect 1 and get 1print(df.rechunk().n_chunks())
Polars version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of Polars.
Issue description
When reading in a parquet the
rechunk
parameter is not respected.Reproducible example
Expected behavior
Installed versions
The text was updated successfully, but these errors were encountered: