-
Notifications
You must be signed in to change notification settings - Fork 507
Closed
Labels
help wantedExtra attention is neededExtra attention is needed
Description
Work committed to
These are projects current contributors are working on.
- (P0) Data Acceptance Tests running in CI (@wjones127)
- (P0) Fully protocol compliant optimistic commit protocol (conflict resolution). - (feat: optimistic transaction protocol #632) (@roeap)
- (P0) ADBC driver: create / read / append / overwrite (@wjones127)
- Lay foundation for DuckDB plugin, more language bindings (R), and cross-language Polars support (R and Javascript, in addition to Python)
- (P1) Python bindings integrated with ADBC driver (@wjones127)
- ADBC to supersede PyArrow-based reader / writer.
- (P0) Remove experimental marker from Python writer (@wjones127)
- (P0) Writer version 2 support in operation module (@wjones127)
- (TBD) Provide async features in the Python binding (@fvaleye)
- (TBD) Airbyte <> Delta Lake integration (@fvaleye)
- More Rust documentation
- Figure out where to host
- Figure out SEO
- Probably migrate off of github.io
- Blog posts (@MrPowers)
- PyO3 blog post good for Rust audience
- Content for Azure. Developer advocacy arm of Azure is very impressive. They spread this message.
- Usage of the Python module is more compelling
- Kafka-delta-ingest reduced writer cost 25 times. Christian & Tyler co-authors.
- Purge Ruby bindings. They’re not usable.
Projects seeking contributors
In addition to smaller issues labelled good-first-issue, these are some larger projects that we could use some help on. Most of them will be implemented as part of the operations
module in the Rust source and can later be exposed to Python and other bindings.
-
DELETE
operation (Implement simple delete case #832) -
UPDATE
operation (Support Update operation #1126) -
MERGE
operation (Implement merge command #850) -
OPTIMIZE
operation, which currently only works on append-only tables (Support Optimize on non-append-only tables #1125)- Z-order implementation (Implement Z-order sorting option in Optimize operation #1127)
- Optimized Parquet compaction (Concatenate parquet files without deserializing? apache/arrow-rs#1711)
- Optimize
VACUUM
with bulk requests (Update pyproject definition in pyproject.toml #405, Bulk delete support for object-store apache/arrow-rs#2615) - Support column mapping (Support column mapping #930)
- Support deletion vector (Deletion vectors #1094)
- Create a file caching layer (Feature request : add delta cache #769)
raphaelauv, GuillaumeDesforges, MrPowers, craustin, saryeHaddadi and 2 moreFlavioDiasPshouqp, dominikpeter, MrPowers, keen85, junhl and 6 morehongbo-miao and ion-elgreco
Metadata
Metadata
Assignees
Labels
help wantedExtra attention is neededExtra attention is needed