Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add experimental filesystem="arrow" support in dask_cudf.read_parquet #16684

Merged
merged 49 commits into from
Sep 25, 2024
Merged
Changes from 1 commit
Commits
Show all changes
49 commits
Select commit Hold shift + click to select a range
469bc5e
allow pyarrow-based read with cudf backend
rjzamora Aug 27, 2024
f20cc25
re-org
rjzamora Aug 27, 2024
8f0f598
temporary change for debugging
rjzamora Aug 28, 2024
64fd701
adjust for upstream bug
rjzamora Aug 28, 2024
8e0c902
remove stale comment
rjzamora Aug 28, 2024
18e1c08
add file aggregation
rjzamora Aug 28, 2024
5215a05
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Aug 29, 2024
c51a7bb
test coverage
rjzamora Aug 29, 2024
b7a90c1
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Aug 29, 2024
43274e2
allow aggregate_files=True
rjzamora Aug 30, 2024
63c3f04
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Aug 30, 2024
a1bd43c
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Aug 30, 2024
e3ca47f
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 3, 2024
12c09a5
fix test
rjzamora Sep 3, 2024
daee7ec
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 4, 2024
d068103
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 4, 2024
257eb26
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 5, 2024
ec38b1e
Make isinstance check pass for proxy ndarrays (#16601)
Matt711 Sep 5, 2024
853c76b
Performance improvement for strings::slice for wide strings (#16574)
davidwendt Sep 5, 2024
bdd2bab
skip for pyarrow<15
rjzamora Sep 6, 2024
d943d8d
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 6, 2024
eb9eee0
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 10, 2024
b9c5147
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 10, 2024
ec04e78
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 18, 2024
e391789
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
rjzamora Sep 19, 2024
e154d01
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 19, 2024
3246d67
Intentionally leak thread_local CUDA resources to avoid crash (part 1…
kingcrimsontianyu Sep 19, 2024
2f424f2
Access Frame attributes instead of ColumnAccessor attributes when ava…
mroeschke Sep 19, 2024
362195d
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 24, 2024
4ce83d4
isolate expermental code path
rjzamora Sep 24, 2024
4d87013
remove unncessary logic
rjzamora Sep 24, 2024
e5b272a
remove unncessary logic - forgot to save
rjzamora Sep 24, 2024
8d87c54
add warning
rjzamora Sep 24, 2024
8cfe71e
remove blocksize and aggregate_files handling
rjzamora Sep 24, 2024
badf359
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 24, 2024
4c1c5ae
warn rather than raise for blocksize
rjzamora Sep 24, 2024
3f1d925
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
galipremsagar Sep 24, 2024
8c267c7
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 25, 2024
91d2d77
address code review from mads
rjzamora Sep 25, 2024
239639f
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 25, 2024
c944a52
Merge branch 'branch-24.10' into dask-cudf-arrow-filesystem
galipremsagar Sep 25, 2024
791a4fd
more cleanup
rjzamora Sep 25, 2024
4c5ee6d
remove warning and add not to best-practices
rjzamora Sep 25, 2024
4d28db7
Build `cudf-polars` with `build.sh` (#16898)
brandon-b-miller Sep 25, 2024
9aa5aca
Fix DataFrame.drop(columns=cudf.Series/Index, axis=1) (#16712)
mroeschke Sep 25, 2024
42a15ee
[DOC] Update Pylibcudf doc strings (#16810)
Matt711 Sep 25, 2024
2c5bb57
Optimization of tdigest merge aggregation. (#16780)
nvdbaranec Sep 25, 2024
ed19b2e
Display deltas for `cudf.pandas` test summary (#16864)
galipremsagar Sep 25, 2024
aa492f5
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
rjzamora Sep 25, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Merge remote-tracking branch 'upstream/branch-24.10' into dask-cudf-a…
…rrow-filesystem
  • Loading branch information
rjzamora committed Sep 4, 2024
commit d0681035294f6ce2271553f9e3f2042f0c0f96d8

This merge commit was added into this branch cleanly.

There are no new changes to show, but you can still view the diff.