Closed
Description
Feature request
Which Delta project/connector is this regarding?
- Spark
- Standalone
- Flink
- Kernel
- Other (fill in here)
Overview
Identity Column (writer version 6) as defined by https://github.com/delta-io/delta/blob/master/PROTOCOL.md#identity-columns.
Design doc: https://docs.google.com/document/d/1G8Vj6wOxswMx1JklllLoSn-obEpJ-iE_Lhpbd-RfIr4/edit?usp=sharing
PR:
- [Spark] Identity Columns APIs in DeltaColumnBuilder #2857
- [Spark] Add IdentityColumnsTableFeature #2859
- [Spark] Add GenerateIdentityValues UDF #2915
- [Spark] Add IdentityColumn.scala #2916
- [Spark] ALTER TABLE ALTER COLUMN SYNC IDENTITY SQL support #3005
- [Spark] Identity Columns Value Generation (without MERGE support) #3023
- [Spark] Python DeltaTableBuilder API for Identity Columns #3404
- [Spark] Block unsupported operations on identity columns #3457
- [Spark] Support clone and restore for Identity Columns #3459
- [SPARK] Relax metadata conflict for identity column #3525
- [SPARK][TEST-ONLY] Add more tests for Identity Column #3526
- [SPARK] Allow non-deterministic expressions in actions of merge #3558
- [SPARK] Refactor IdentityColumnTestUtils #3568
- [SPARK] Add MERGE support for tables with Identity Columns #3566
- [SPARK][TEST-ONLY] Use unique table name for identity column tests #3594
- [SPARK] Move identity column feature out of dev mode #3598
Motivation
This is probably the biggest missing part in Open Source Spark Delta.
Further details
Willingness to contribute
@c27kwan volunteered to work on this feature and posted a design doc here.
Metadata
Assignees
Labels
Type
Projects
Status
Done