Skip to content

Any way to support dictionary encoded columns with ARROW_CHUNK_ROWS? #37

@nugend

Description

@nugend

At the moment this throws an unequal length arrays error when attempting to pass the dictionary and indices in for a column represented that way.

The functionality is definitely supported by the AtrowStream format. It seems like the issue is that the MakeDictionary function and the MakeChunkedArray function don’t play nicely together. I’m not sure what the preferred solution is. I’m happy to handle preparing the value array manually and passing the indices in with an explicit reference if that’s what’s needed.

If you want to handle it in the library, my guess is you could handle the values and indices in separate passes?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions