Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add arrow datafusion python dependency #1085

Merged
merged 60 commits into from
May 4, 2023

Conversation

jdye64
Copy link
Collaborator

@jdye64 jdye64 commented Mar 14, 2023

Closed with #1084

…hon bindings for DataFusion and bare minimum refactoring we need in our repo to accomodate them
@jdye64
Copy link
Collaborator Author

jdye64 commented Mar 15, 2023

Waiting on oxidecomputer/typify#221 to merge so that conda builds can work with new setup.

@jdye64 jdye64 removed the request for review from andygrove March 15, 2023 13:49
@jdye64 jdye64 changed the base branch from main to arrow_datafusion_python_refactor March 16, 2023 20:45
dask_sql/mappings.py Outdated Show resolved Hide resolved
@jdye64 jdye64 marked this pull request as ready for review April 26, 2023 23:12
Copy link
Collaborator

@charlesbluca charlesbluca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @jdye64 🎉 a few comments around how we include protoc in CI/packaging and some follow up around the timedelta changes

.github/workflows/conda.yml Outdated Show resolved Hide resolved
.github/workflows/rust.yml Show resolved Hide resolved
.github/workflows/test.yml Outdated Show resolved Hide resolved
.github/workflows/test.yml Outdated Show resolved Hide resolved
dask_sql/mappings.py Show resolved Hide resolved
dask_sql/physical/rex/core/literal.py Show resolved Hide resolved
tests/integration/test_groupby.py Outdated Show resolved Hide resolved
tests/integration/test_select.py Outdated Show resolved Hide resolved
dask_planner/.cargo/config.toml Show resolved Hide resolved
dask_planner/Cargo.toml Show resolved Hide resolved
dask_planner/src/sql/logical.rs Outdated Show resolved Hide resolved
dask_planner/src/sql/optimizer.rs Outdated Show resolved Hide resolved
@@ -926,8 +926,8 @@ def test_timestampdiff(c):
"hours": [46, 6648, 591744],
"days": [1, 277, 24656],
"weeks": [0, 39, 3522],
"months": [0, 9, 810],
"quarters": [0, 3, 270],
"months": [0, 9, 821],
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since this behavior is different from the results we get from mysql and a few implementations that support timestampdiff, I'm not sure whether we want to keep the existing logic or modify it. I didn't find an easy way to get a timestampdiff with datafusion/postgresql

Copy link
Collaborator

@ayushdg ayushdg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for these changes @jdye64!

Comment on lines +1 to +13
[build-system]
requires = ["setuptools", "wheel", "setuptools-rust"]

[tool.isort]
profile = "black"

[tool.maturin]
include = [
{ path = "Cargo.lock", format = "sdist" }
]
exclude = [".github/**", "ci/**", ".asf.yaml"]
# Require Cargo.lock is up to date
locked = true
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Noting that I've been thinking about consolidating as much of our project metadata into pyproject.toml as possible (similar to other RAPIDS/Dask projects) which would make this file less stubby

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 to this. I've also been thinking of simplifying the setup.py scripts and moving as much as possible into the pyproject.toml file

@ayushdg ayushdg merged commit 108e024 into dask-contrib:main May 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants