You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Dask-SQL currently supports our own custom set of Rust PyO3 bindings for Apache Arrow DataFusion. Since we started that effort interest has grown in that community around offering their own set of Python bindings for Arrow DataFusion. It seems sensible to me to contribute the bindings that we have and gain the development support from that community and alleviate our developer time for features and enhancements.
This EPIC is setup to track the effort of moving code to Arrow DataFusion Python and then refactoring our codebase to subsequently use it.
While the PRs will mostly be simple in nature there is likely to be several. The choice was made to do several PRs in favor of a single large PR so reviewing would be more quick and easy and to help identify any possible regressions that might present themselves in a more cornered manner.
I will attempt to keep this list up to date with PRs relevant to this effort and their status
Improve build command so that python bindings can be built "out of band", meaning projects like Dask-SQL can build the python bindings and link to their via their own Cargo build process
jdye64
changed the title
EPIC: Refactor Dask-SQL codebase to use Apache Arrow DataFusion Python
EPIC: Contribute Dask-SQL codebase to Apache Arrow DataFusion Python
Mar 13, 2023
Is your feature request related to a problem? Please describe.
Dask-SQL currently supports our own custom set of Rust PyO3 bindings for Apache Arrow DataFusion. Since we started that effort interest has grown in that community around offering their own set of Python bindings for Arrow DataFusion. It seems sensible to me to contribute the bindings that we have and gain the development support from that community and alleviate our developer time for features and enhancements.
This EPIC is setup to track the effort of moving code to Arrow DataFusion Python and then refactoring our codebase to subsequently use it.
While the PRs will mostly be simple in nature there is likely to be several. The choice was made to do several PRs in favor of a single large PR so reviewing would be more quick and easy and to help identify any possible regressions that might present themselves in a more cornered manner.
I will attempt to keep this list up to date with PRs relevant to this effort and their status
Arrow DataFusion Python - Worklog
Dask-SQL - Worklog
The text was updated successfully, but these errors were encountered: