Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request: Add google cloud storage source #7501

Open
brianpham opened this issue May 18, 2021 · 8 comments
Open

Request: Add google cloud storage source #7501

brianpham opened this issue May 18, 2021 · 8 comments
Labels
provider: gcp Anything `gcp` service provider related source: new A request for a new source type: feature A value-adding code addition that introduce new functionality.

Comments

@brianpham
Copy link

brianpham commented May 18, 2021

As a replacement for logstash, we will need vector to support google cloud storage as a source similar to AWS S3 (https://vector.dev/docs/reference/configuration/sources/aws_s3/)

We do something similar in logstash

input { 
  google_cloud_storage {    
    bucket_id => "my_log_bucket"    
    file_matches =>  ".*\.log"    
    tags => ["server"]    
    codec => "json"  
  }
}
@brianpham brianpham added the type: feature A value-adding code addition that introduce new functionality. label May 18, 2021
@brianpham brianpham changed the title Add support for google cloud storage as a source Request: Add google cloud storage source May 18, 2021
@jszwedko jszwedko added the provider: gcp Anything `gcp` service provider related label May 19, 2021
@jszwedko
Copy link
Member

@jszwedko jszwedko added the source: new A request for a new source label Dec 28, 2022
@maxdialpad
Copy link

Is there anything on the roadmap for this source?

@jszwedko
Copy link
Member

jszwedko commented Mar 17, 2023

Not yet, but we have been experimenting with OpenDAL, which was recently used to add a WebHDFS sink, and does have support for GCS. It could be an avenue to experiment with if anyone wants to take a shot at this.

@swgillespie
Copy link

@jszwedko Would you accept a PR that implements this in roughly the same way that the aws_s3 source is implemented, i.e. via event notifications in a PubSub topic?

@jszwedko
Copy link
Member

@jszwedko Would you accept a PR that implements this in roughly the same way that the aws_s3 source is implemented, i.e. via event notifications in a PubSub topic?

Hey! Yes, I think that would make sense as the initial implementation to match the behavior of the aws_s3 source.

@Xuanwo
Copy link
Contributor

Xuanwo commented Oct 6, 2024

Not yet, but we have been experimenting with OpenDAL, which was recently used to add a WebHDFS sink, and does have support for GCS. It could be an avenue to experiment with if anyone wants to take a shot at this.

Hi, @jszwedko. I'm willing to help implement the GCS source, but I might not have time to complete the full documentation. Do you think it's a good idea to start the implementation first? For example, all content under src/sinks/webhdfs but not website/**/webhdfs.

@jszwedko
Copy link
Member

jszwedko commented Oct 7, 2024

Not yet, but we have been experimenting with OpenDAL, which was recently used to add a WebHDFS sink, and does have support for GCS. It could be an avenue to experiment with if anyone wants to take a shot at this.

Hi, @jszwedko. I'm willing to help implement the GCS source, but I might not have time to complete the full documentation. Do you think it's a good idea to start the implementation first? For example, all content under src/sinks/webhdfs but not website/**/webhdfs.

Hey! That'd be great! I think starting with the implementation makes sense. We can help with the docs if you get stuck.

@Xuanwo
Copy link
Contributor

Xuanwo commented Oct 7, 2024

Hey! That'd be great! I think starting with the implementation makes sense. We can help with the docs if you get stuck.

That's really appreciated. I will find some time next week to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
provider: gcp Anything `gcp` service provider related source: new A request for a new source type: feature A value-adding code addition that introduce new functionality.
Projects
None yet
Development

No branches or pull requests

5 participants