-
Notifications
You must be signed in to change notification settings - Fork 4.8k
HIVE-26954: Upgrade avro to 1.11.1 #4012
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
00d3314
to
b264473
Compare
@akshat0395, one question: Shouldn't we need to be in-sync with Hadoop (3.3.1 used in hive) on avro version even though we are already ahead. Because difference in file format version might cause problem. Because I think the hadoop jars will be loaded first in classpath which will already contain hadoop avro version (1.7.7)? There is some discussion on same for this on PR 3878 (HIVE-26829). |
|
Kudos, SonarCloud Quality Gate passed! |
++ @ayushtkn |
I am not sure what blocks us from moving forward, we have already diverged from hadoop, Hadoop is at 1.7.7 And we are at 1.8.2 already. So, if anything has to break due to hadoop it should have been broken by now, but we haven't got hold of anything till now, the upgrade was done on branch-3 and branch-2 as well(HIVE-19662), so divergence b/w hadoop and hive avro version is already there. HIVE-24324 was done to pave way for future upgrades. The tests are green. @akshat0395 can you deploy and try some tests on an actual cluster, if things work. If we don't find any conclusive evidence like. "This breaks due to Avro upgrade", I think we can move forward. @zabetak / @abstractdog any thoughts around this? |
@ayushtkn I am not aware of the history but I agree with what you are saying. If there are no proofs that it breaks something then I am fine proceeding with the upgrade. |
Thanks @ayushtkn, @zabetak for the input. I've done some further testing and analysis for the same.
I've performed some basic Insert, Modify, and aggregation query on the current version, post that I've performed the same set of queries with the upgraded version and both seems to work without any issues. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@akshat0395 Are there (enough) tests already in Hive reading and writing from Avro files/tables? |
@akshat0395 Do we have (should we add) in the repo tests that are reading from Avro files written with old writers? |
There are notable number of q test present in Hive specific to avro that covers various read/write scenarios. Following is the list of q test:
I think this cover alot of ground, Please let me know your thoughts in case we need to add more specific cases @zabetak |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Will merge soon.
…axena, Stamatis Zampetakis, Chris Nauroth) Co-authored-by: Raghav Aggarwal <raghavaggarwal03.ra@gmail.com> Closes apache#3878 Closes apache#4012
…axena, Stamatis Zampetakis, Chris Nauroth) Co-authored-by: Raghav Aggarwal <raghavaggarwal03.ra@gmail.com> Closes apache#3878 Closes apache#4012
…axena, Stamatis Zampetakis, Chris Nauroth) Co-authored-by: Raghav Aggarwal <raghavaggarwal03.ra@gmail.com> Closes apache#3878 Closes apache#4012
…axena, Stamatis Zampetakis, Chris Nauroth) Co-authored-by: Raghav Aggarwal <raghavaggarwal03.ra@gmail.com> Closes apache#3878 Closes apache#4012
What changes were proposed in this pull request?
Upgrade Avro to 1.11.1
HIVE-26954
Why are the changes needed?
Keeping file formats to latest version
Does this PR introduce any user-facing change?
No
How was this patch tested?
Build + Unit tests