Fusing partial aggregation with repartition

### Is your feature request related to a problem or challenge?

I impl a poc https://github.com/apache/datafusion/pull/12526, and found this idea can actually improve performance.

But for some reasons stated in https://github.com/apache/datafusion/issues/11680#issuecomment-2368735093

I think this improvement is not so suitable to be pushed forward currently.

Just file an issue to track it.


### Describe the solution you'd like

- Introduce the partitioned hashtable in `partial aggregation`, and we partition the datafusion before inserting them into hashtable.
- And we push them into `final aggregation` partition by partition after, rather than split them again in `repartition`, and merge them again in `coalesce`.

### Describe alternatives you've considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fusing partial aggregation with repartition #12596

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fusing partial aggregation with repartition #12596

Description

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions