Skip to content

Data Deletion Issue with MERGE INTO ... WHEN MATCHED THEN DELETE statement in Iceberg 1.3 with Spark 3.4.1 #8126

Open
@alesk

Description

@alesk

Apache Iceberg version

1.3.0 (latest release)

Query engine

Spark

Environment:

Iceberg version: 1.3
Spark version: 3.4.1

Description:
When using the MERGE INTO ... WHEN MATCHED THEN DELETE statement to delete rows from an Iceberg table, an issue occurs if one of the remaining rows has a null value in any column. This issue does not happen when using Iceberg 1.3 with Spark 3.3.2. The table must be sorted to trigger the bug.

Steps to Reproduce:

Steps to reproduce with code and example are described in https://github.com/alesk/spark-bug-iceberg-delete

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions