Skip to content

Conversation

@Aggarwal-Raghav
Copy link
Contributor

@Aggarwal-Raghav Aggarwal-Raghav commented Nov 13, 2025

What changes were proposed in this pull request?

TEZ-4661 Include commons-collections3.x in hive-exec jar

Why are the changes needed?

In Tez-1.0.0-SNAPSHOT, hadoop has been upgraded to 3.4.2 and hadoop dependencies in tez project has stopped shipping commons-collections-3.x. But hive still depends on commons-collections3.x directly as well as thorugh third-party dependency like opencsv, commons-beanutils etc. It's better to shade until hive moves completely to commons-collection-4.x

Does this PR introduce any user-facing change?

No

How was this patch tested?

On local setup

@Aggarwal-Raghav
Copy link
Contributor Author

Aggarwal-Raghav commented Nov 13, 2025

Error Stacktrace:
Screenshot 2025-11-13 at 9 40 36 PM

@Aggarwal-Raghav
Copy link
Contributor Author

Aggarwal-Raghav commented Nov 13, 2025

In order to migrate from commons-collections3.x to 4.x in hive, few prerequisites are there:

  1. Migrate hadoop to 3.4.2
  2. Migrate opencsv to 5.12.0
  3. commons-beanutils 1.x to 2.x migration. NOTE: 2.x is not offically released, only milestone version is present https://issues.apache.org/jira/browse/BEANUTILS-532?focusedCommentId=17908246&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17908246
  4. Need to figure out what to do about atlas-intg dependecy as it also brings commons-collections-3.x
  5. Please check HIVE-28486: Upgrade commons-collections to commons-collections4 due t… #5588 review comments for more details

@sonarqubecloud
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants