Skip to content

[Enhancement] Support streaming files from a remote storage Blob, S3, etc #100

Closed
@stuarthrichardson

Description

Provide the ability to be able to support streaming files from a remote storage AWS, S3, etc. After using the file-pulse connector, I am now having requirements to be able ingest data from files on remote storage as focus is moving to cloud storage. More systems are dumping files into Azure blob storage and AWS S3 buckets, and this trend will only grow.

To support the different favours of remote storage, each supported remote storage I think would need a version of the FSDirectoryWalker and FileInputReader, as the configuration requirements would be different for each. Also I think some of the internal interfaces would need to change too.

I really like how to have implemented the "filter chain", if could be possible to implement something along the lines where there is "fs" configuration for the different storage favours.

I would be happy to contribute.

What do you think?

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions