[Bug] When spark writes data to the paimon table, data is lost due to some task retries

### Search before asking

- [X] I searched in the [issues](https://github.com/apache/paimon/issues) and found nothing similar.


### Paimon version

0.9

### Compute Engine

spark3.5.1

### Minimal reproduce step

1. Save data to the paimon table
   ` dataset.write().mode(mode).format("paimon").save(path);`
2. Perform to the stage (collect at PaimonSparkWriter. Scala: 195), Some nodes are lost. Try again
![image](https://github.com/user-attachments/assets/ecbf57d3-9218-49a0-b7fc-9e00f6d1f01e)
![image](https://github.com/user-attachments/assets/80bc1c54-4e14-4d3d-9728-72d44ffb4645)
![image](https://github.com/user-attachments/assets/6d1c4a5f-7390-4fee-9344-00bf20c86b68)
3. The amount of data written by the two retries is different from that of the final query
![image](https://github.com/user-attachments/assets/2c5ea215-08e4-49c2-8069-eeea5cc12022)
![image](https://github.com/user-attachments/assets/978c6f59-d180-4850-8614-7167ec6f02da)
9314203 + 6211188 = 15525391
But the amount of data queried from the paimon table is 15476552
![image](https://github.com/user-attachments/assets/f21db31b-366e-446f-a55c-e002afe4ac0a)



### What doesn't meet your expectations?

When I increased execu's memory, the task did not retry and ended up writing 15,525,244 pieces of data. I guess the possible reason is that the task retry will overwrite the file written the first time, or some other possibility

### Anything else?

_No response_

### Are you willing to submit a PR?

- [ ] I'm willing to submit a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug] When spark writes data to the paimon table, data is lost due to some task retries #4831

Search before asking

Paimon version

Compute Engine

Minimal reproduce step

What doesn't meet your expectations?

Anything else?

Are you willing to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] When spark writes data to the paimon table, data is lost due to some task retries #4831

Description

Search before asking

Paimon version

Compute Engine

Minimal reproduce step

What doesn't meet your expectations?

Anything else?

Are you willing to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions