Commit 9726699
Upgrade to DataFusion 44 (#368)
* Get working build
* Add pool_size method to MemoryPool (#218) (#230)
* Add pool_size method to MemoryPool
* Fix
* Fmt
Co-authored-by: Daniël Heres <danielheres@gmail.com>
* Respect `IGNORE NULLS` flag in `ARRAY_AGG` (#260) (apache#15544) v48
* Hook for doing distributed `CollectLeft` joins (#269)
* Ignore writer shutdown error (#271)
* ignore writer shutdown error
* cargo check
* Fix bug in `swap_hash_join` (#278)
* Try and fix swap_hash_join
* Only swap projections when join does not have projections
* just backport upstream fix
* remove println
* Support Duration in min/max agg functions (#283) (apache#15310) v47
* Support Duration in min/max agg functions
* Attempt to fix build
* Attempt to fix build - Fix chrono version
* Revert "Attempt to fix build - Fix chrono version"
This reverts commit fd76fe6.
* Revert "Attempt to fix build"
This reverts commit 9114b86.
---------
Co-authored-by: svranesevic <svranesevic@users.noreply.github.com>
* Fix panics in array_union (#287) (apache#15149) v48
* Drop rust-toolchain
* Fix panics in array_union
* Fix the chrono
* Backport `GroupsAccumulator` for Duration min/max agg (#288) (apache#15322) v47
* Fix array_sort for empty record batch (#290) (apache#15149) v48
* fix: rewrite fetch, skip of the Limit node in correct order (apache#14496) v46
* fix: rewrite fetch, skip of the Limit node in correct order
* style: fix clippy
* Support aliases in ConstEvaluator (apache#14734) (#281) v46
* Support aliases in ConstEvaluator (apache#14734)
Not sure why they are not supported. It seems that if we're not careful,
some transformations can introduce aliases nested inside other expressions.
* Format Cargo.toml
* Preserve the name of grouping sets in SimplifyExpressions (#282) (apache#14888) v46
Whenever we use `recompute_schema` or `with_exprs_and_inputs`,
this ensures that we obtain the same schema.
* Support Duration in min/max agg functions (#284) (apache#15310) v47
Co-authored-by: svranesevic <svranesevic@users.noreply.github.com>
* fix case_column_or_null with nullable when conditions (apache#13886) v45
* fix case_column_or_null with nullable when conditions
* improve sqllogictests for case_column_or_null
---------
Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
* Fix casewhen (apache#14156) v45
* Cherry-pick topk limit pushdown fix (apache#14192) v45
* fix: FULL OUTER JOIN and LIMIT produces wrong results (apache#14338) v45
* fix: FULL OUTER JOIN and LIMIT produces wrong results
* Fix minor slt testing
* fix test
(cherry picked from commit ecc5694)
* Cherry-pick global limit fix (apache#14245) v45
* fix: Limits are not applied correctly (apache#14418) v46
* fix: Limits are not applied correctly
* Add easy fix
* Add fix
* Add slt testing
* Address comments
* Disable grouping set in CSE
* Fix spm + limit (apache#14569) v46
* prost 0.13 / fix parquet dep
* Delete unreliable checks
* Segfault in ByteGroupValueBuilder (#294) (apache#15968) v50
* test to demonstrate segfault in ByteGroupValueBuilder
* check for offset overflow
* clippy
(cherry picked from commit 5bdaeaf)
* Update arrow dependency to include rowid (#295)
* Update arrow version
* Feat: Add fetch to CoalescePartitionsExec (apache#14499) (#298) v46
* add fetch info to CoalescePartitionsExec
* use Statistics with_fetch API on CoalescePartitionsExec
* check limit_reached only if fetch is assigned
Co-authored-by: mertak-synnada <mertak67+synaada@gmail.com>
* Fix `CoalescePartitionsExec` proto serialization (apache#15824) (#299) v48
* add fetch to CoalescePartitionsExecNode
* gen proto code
* Add test
* fix
* fix build
* Fix test build
* remove comments
Co-authored-by: 张林伟 <lewiszlw520@gmail.com>
* Add JoinContext with JoinLeftData to TaskContext in HashJoinExec (#300)
* Add JoinContext with JoinLeftData to TaskContext in HashJoinExec
* Expose random state as const
* re-export ahash::RandomState
* JoinContext default impl
* Add debug log when setting join left data
* Update arrow version for not preserving dict_id (#303)
* Use partial aggregation schema for spilling to avoid column mismatch in GroupedHashAggregateStream (apache#13995) (#302) v45
* Refactor spill handling in GroupedHashAggregateStream to use partial aggregate schema
* Implement aggregate functions with spill handling in tests
* Add tests for aggregate functions with and without spill handling
* Move test related imports into mod test
* Rename spill pool test functions for clarity and consistency
* Refactor aggregate function imports to use fully qualified paths
* Remove outdated comments regarding input batch schema for spilling in GroupedHashAggregateStream
* Update aggregate test to use AVG instead of MAX
* assert spill count
* Refactor partial aggregate schema creation to use create_schema function
* Refactor partial aggregation schema creation and remove redundant function
* Remove unused import of Schema from arrow::datatypes in row_hash.rs
* move spill pool testing for aggregate functions to physical-plan/src/aggregates
* Use Arc::clone for schema references in aggregate functions
(cherry picked from commit 81b50c4)
Co-authored-by: kosiew <kosiew@gmail.com>
* Update tag
* Push limits past windows (#337) (apache#17347) v50
* Restore old method for DQE
* feat(optimizer): Enable filter pushdown on window functions (apache#14026) v45
* Avoid Aliased Window Expr Enter Unreachable Code (apache#14109) v45
(cherry picked from commit fda500a)
* Use `Expr::qualified_name()` and `Column::new()` to extract partition keys from window and aggregate operators (#355) (apache#17757) v51
* Update PR template to be relevant to our fork
* Make limit pushdown work for SortPreservingMergeExec (apache#17893) (#361)
* re-publicise functions DQE relies on
* Handle columns in with_new_exprs with a Join (apache#15055) (#384)
apache#15055
* handle columns in with_new_exprs with Join
* test doesn't return result
* take join from result
* clippy
* make test fallible
* accept any pair of expression for new_on in with_new_exprs for Join
* use with_capacity
Co-authored-by: delamarch3 <68732277+delamarch3@users.noreply.github.com>
---------
Co-authored-by: Georgi Krastev <georgi.krastev@coralogix.com>
Co-authored-by: Daniël Heres <danielheres@gmail.com>
Co-authored-by: Dan Harris <1327726+thinkharderdev@users.noreply.github.com>
Co-authored-by: Faiaz Sanaulla <105630300+fsdvh@users.noreply.github.com>
Co-authored-by: Sava Vranešević <20240220+svranesevic@users.noreply.github.com>
Co-authored-by: svranesevic <svranesevic@users.noreply.github.com>
Co-authored-by: Yingwen <realevenyag@gmail.com>
Co-authored-by: Zhang Li <richselian@gmail.com>
Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
Co-authored-by: Aleksey Kirilishin <54231417+avkirilishin@users.noreply.github.com>
Co-authored-by: xudong.w <wxd963996380@gmail.com>
Co-authored-by: Qi Zhu <821684824@qq.com>
Co-authored-by: Martins Purins <martins.purins@coralogix.com>
Co-authored-by: mertak-synnada <mertak67+synaada@gmail.com>
Co-authored-by: 张林伟 <lewiszlw520@gmail.com>
Co-authored-by: kosiew <kosiew@gmail.com>
Co-authored-by: nuno-faria <nunofpfaria@gmail.com>
Co-authored-by: Berkay Şahin <124376117+berkaysynnada@users.noreply.github.com>
Co-authored-by: Mason Hall <mason.hall@coralogix.com>
Co-authored-by: delamarch3 <68732277+delamarch3@users.noreply.github.com>1 parent 3cc3fca commit 9726699
File tree
89 files changed
+4204
-1808
lines changed- .github
- actions
- setup-builder
- setup-macos-aarch64-builder
- setup-macos-builder
- setup-rust-runtime
- workflows
- benchmarks
- queries/clickbench
- ci/scripts
- datafusion-cli
- datafusion-examples
- datafusion
- common
- src
- core
- src
- catalog_common
- dataframe
- datasource/file_format/write
- physical_optimizer
- tests
- expr_api
- physical_optimizer
- sql
- execution/src/memory_pool
- expr/src/logical_plan
- functions-aggregate/src
- functions-nested/src
- optimizer
- src
- simplify_expressions
- physical-expr-common/src
- physical-expr/src/expressions
- physical-optimizer
- src
- physical-plan/src
- aggregates
- group_values
- multi_group_by
- joins
- proto-common/gen
- proto
- gen
- proto
- src
- generated
- physical_plan
- tests/cases
- sqllogictest/test_files
- substrait
- docs/source/user-guide
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
89 files changed
+4204
-1808
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | | - | |
43 | | - | |
| 42 | + | |
44 | 43 | | |
45 | 44 | | |
46 | 45 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | | - | |
43 | | - | |
| 42 | + | |
44 | 43 | | |
45 | 44 | | |
46 | 45 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
24 | | - | |
25 | 23 | | |
26 | 24 | | |
27 | 25 | | |
| |||
31 | 29 | | |
32 | 30 | | |
33 | 31 | | |
34 | | - | |
35 | | - | |
36 | 32 | | |
37 | 33 | | |
38 | 34 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| 2 | + | |
2 | 3 | | |
3 | | - | |
4 | | - | |
5 | | - | |
| 4 | + | |
6 | 5 | | |
7 | | - | |
| 6 | + | |
8 | 7 | | |
9 | | - | |
| 8 | + | |
10 | 9 | | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
0 commit comments