Skip to content

Error occurs when only using partition columns in query #1999

@pjmore

Description

@pjmore

Describe the bug
When using a partitioned datasource selecting only the partitioned columns causes errors.

To Reproduce
Steps to reproduce the behavior:
Create sample parquet/csv file content doesn't matter. Copy file to two separate paths using path partitioning scheme. E.g.
"year=2021/month=09/day=09/file.parquet",
"year=2021/month=10/day=09/file.parquet",

Register listing table and execute query
select distinct year,month, day from t

Expected behavior
Should return record batch with:

+------+-------+-----+
| year | month | day||
+------+-------+-----+
| 2021 | 09    | 09  |
| 2021 | 10    | 09  |
| 2021 | 10    | 28  |
+------+-------+-----+ 

Additional context
This was verified to be a problem with parquet and csv formats. Unsure about avro or json.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions