Description
Provide the ability to be able to support streaming files from a remote storage AWS, S3, etc. After using the file-pulse connector, I am now having requirements to be able ingest data from files on remote storage as focus is moving to cloud storage. More systems are dumping files into Azure blob storage and AWS S3 buckets, and this trend will only grow.
To support the different favours of remote storage, each supported remote storage I think would need a version of the FSDirectoryWalker and FileInputReader, as the configuration requirements would be different for each. Also I think some of the internal interfaces would need to change too.
I really like how to have implemented the "filter chain", if could be possible to implement something along the lines where there is "fs" configuration for the different storage favours.
I would be happy to contribute.
What do you think?
Activity