Reorganize table providers by table format

**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
Currently the `TableProvider` implementations are split by file format (Parquet, CSV...). One other solution to organize `TableProvider`s would be by table format (file system listing, Iceberg, [Delta](https://github.com/delta-io/delta-rs/blob/main/rust/src/delta_datafusion.rs)). 

**Describe the solution you'd like**
- `ExecutionPlan` implementations would remain organized by file format. A `TableProvider` could create different types of execution plan according to its configuration or auto-discovering the data file format from the information stored in the table format
- the current implementations for Parquet, CSV, JSON and Avro would go into a `ListingTable` provider. Implicitly the table format implemented currently:
  - is given a directory as input
  - discovers the files using the file system "listing" operation
- Schema inference, when required, would be resolved outside the `TableProvider` and and would be exposed as a service by ballista

**Describe alternatives you've considered**
An alternative is to leave the table providers organized as is and try to solve the table formats at a different moment of the planning. **This is discussed in this [design document](https://docs.google.com/document/d/1Bd4-PLLH-pHj0BquMDsJ6cVr_awnxTuvwNJuWsTHxAQ/edit?usp=sharing).**

This `ListingTable` provider could also be added into an external crate. But in that case it would be a partial fork of DataFusion that would require to be maintained separately.

**Additional context**
- This will help solving apache/arrow-datafusion#133
- It helps solving Ballista issues
  - apache/arrow-ballista#22
  - apache/arrow-ballista#14
  - apache/arrow-datafusion#871
- This is related and and complementary to apache/arrow-datafusion#944
- This replaces the `TableDescriptor` abstraction added in apache/arrow-datafusion#932


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Reorganize table providers by table format #1009

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Reorganize table providers by table format #1009

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions