Spark aggreation by partition could use metadata files

Hello everybody,
I have a apache iceberg table in aws glue, this table is partitioned by string year-month.
When I do a spark.sql("select count(1),partition_field from table group by partition_field"). Spark goes through every file and perform the count. Cant spark engine use just the metadata files as each underlying data file just contains data from one partition.
(I dont have any delete file)

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark aggreation by partition could use metadata files #11394

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Spark aggreation by partition could use metadata files #11394

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions