-
Notifications
You must be signed in to change notification settings - Fork 9.1k
HADOOP-13126 Add BrotliCodec based on Brotli4j library #2723
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: trunk
Are you sure you want to change the base?
HADOOP-13126 Add BrotliCodec based on Brotli4j library #2723
Conversation
💔 -1 overall
This message was automatically generated. |
93bcb41
to
47f0593
Compare
💔 -1 overall
This message was automatically generated. |
Signed-off-by: Martin Tzvetanov Grigorov <mgrigorov@apache.org>
💔 -1 overall
This message was automatically generated. |
What is the policy about using
|
Colleagues, I've taken the source code from this commit 47f0593 made a jar of it, plugged it into my Spark cluster, launched a huge job with many transformations and actions, and found that there is a serious memory leak: executors consume RAM more and more (no matter that there is a limitation of 20GB, they consumed 40GB). I've made my own version of Brotli codec (also based on brotli4j) by looking at how Snappy and others are made, and it works with no memory leaks. Soon I'll post my PR. |
@ibobak If the change is small you can also tell me what to change and I can update this PR. |
Update is big. I am now testing my version of the codec in my organzation, until I am sure that it works fine and without memory leaks, I won't post a PR. I need a little bit more time. |
What is pending more in this PR? I can try to help and get this merged. Brotli4j maintainer here ^_^ |
@hyperxpro feel free to edit this PR or open a new one based on it. |
Adds BrotliCodec - a compression codec based on Google Brotli
This PR is a continuation on the work done by @rdblue at https://issues.apache.org/jira/browse/HADOOP-13126
In his patches it was based on jbrotli library but this library is not maintained since few years. My PR uses Brotli4j