-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Hudi DeltaStreamer
compatible feature
#8724
Comments
KafkaConnect is cool! Deltastreamer also supports the distributed-file-system ingestion. Using this, e.g. we can ingest the raw AVRO/CSV/JSON data in S3 to Hudi. |
We use Flink for the ingestion. |
Yes, Flink is great but still we need to write some code for ingestion, right..? Hudi Deltastreamer is a kind of NoCode solution and I think it will make it easier to ingest data. |
Yes, you need to write Flink the job's code. If your goal is just a simple dump, then it could be an overkill, but if you need to do any transformation, you can do it in your code easily |
This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible. |
This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' |
Feature Request / Improvement
Hi! Currently, we are evaluating Iceberg & Hudi and both tools are great and provide similar features.
One thing we noticed that Hudi Deltastreamer makes it easy to ingest data and it would be great if Iceberg support similar feature.
Thank you!
Query engine
None
The text was updated successfully, but these errors were encountered: