Distinct  aggregates return incorrect results

### Describe the bug

When translating aggregate expressions to DataFusion we ignore whether the aggregate is distinct or not, resulting in incorrect behavior.

The existing tests seem to pass because the input data does not contain duplicates.

### Steps to reproduce

Modify test `"single group-by column + aggregate column, multiple batches, no null"` to use `COUNT(DISTINCT _2)` instead of `COUNT(DISTINCT _1)` and the test fails because the results do not match Spark.

### Expected behavior

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Distinct aggregates return incorrect results #1260

Describe the bug

Steps to reproduce

Expected behavior

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Distinct aggregates return incorrect results #1260

Description

Describe the bug

Steps to reproduce

Expected behavior

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions