Skip to content

Conversation

@bobbai00
Copy link
Contributor

@bobbai00 bobbai00 commented Sep 23, 2025

Summary

This PR pins the R arrow package to version 14.0.2.1 in the computing-unit-master Dockerfile to ensure compatibility with R operators.

Problem

The R arrow package was not pinned to a specific version, causing the latest version (21.0.0) to be installed. This version is incompatible with Texera's R operators, causing them to fail at runtime.

Solution

  • In the dockerfile, Pin arrow package to version 14.0.2.1 using remotes::install_version()
  • Add upgrade='never' parameter to prevent automatic upgrades

Fixes #3760

- Split R package installation into two steps
- Install coro and dplyr packages first
- Pin arrow package to version 14.0.2.1 using remotes::install_version
- Add upgrade='never' to prevent automatic upgrades
- Include verification of installed arrow version

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@chenlica
Copy link
Contributor

@aglinxinyuan @mengw15 Please review this PR.

@chenlica
Copy link
Contributor

I added @aglinxinyuan to the reviewer list as you are familiar with the overall structure of pinning libraries.

Copy link
Contributor

@aglinxinyuan aglinxinyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@aglinxinyuan aglinxinyuan added the dependencies Pull requests that update a dependency file label Sep 23, 2025
@bobbai00
Copy link
Contributor Author

@chenlica @aglinxinyuan Arrow 14.0.2.1 make the R operators work. However, 14.0.2.1 seems too old compared to arrow's latest version (21.0.0). Can I let @mengw15 to investigate why 21.0.0 doesn't work for texera's R operators, and find a more recent version of R arrow that supports R UDF operators?

Copy link
Contributor

@mengw15 mengw15 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@chenlica
Copy link
Contributor

@chenlica @aglinxinyuan Arrow 14.0.2.1 make the R operators work. However, 14.0.2.1 seems too old compared to arrow's latest version (21.0.0). Can I let @mengw15 to investigate why 21.0.0 doesn't work for texera's R operators, and find a more recent version of R arrow that supports R UDF operators?

Sure. Is this investigation blocking?

@bobbai00
Copy link
Contributor Author

@chenlica @aglinxinyuan Arrow 14.0.2.1 make the R operators work. However, 14.0.2.1 seems too old compared to arrow's latest version (21.0.0). Can I let @mengw15 to investigate why 21.0.0 doesn't work for texera's R operators, and find a more recent version of R arrow that supports R UDF operators?

Sure. Is this investigation blocking?

It is not blocking the release.

@aglinxinyuan aglinxinyuan merged commit 75c69ed into apache:main Sep 30, 2025
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

R operators fail with arrow package version 21.0.0

4 participants