Skip to content

Support for ORC in GCSToBigQueryOperator #48624

@will-lauer

Description

@will-lauer

Description

GCSToBigQueryOperator needs to support ORC files in addition to all the other formats that it supports. Both #12556 and #12557 have requested this in the past, and both have ended up getting closed due to implementation issues (deprecated functions in the implementation), but that doesn't remove the need for this feature. Either GCSToBigQueryOperator should be extended to support ALL the file types that the BQ supports, or the operator as a whole should likely be deprecated.

Use case/motivation

As a pipeline developer, I sometimes need to generate ORC files for other tools. I should not be prevents from loading these files into BQ using the GCSToBQ transport operator, but rather be able to load into BQ regardless of file type.

Related issues

#12556 and #12557 have both been closed due to technical issues with the implementation, not due to the feature not being needed.

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions