You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SPARK-48710][PYTHON][3.5] Limit NumPy version to supported range (>=1.15,<2)
### What changes were proposed in this pull request?
* Add a constraint for `numpy<2` to the PySpark package
### Why are the changes needed?
PySpark references some code which was removed with NumPy 2.0. Thus, if `numpy>=2` is installed, executing PySpark may fail.
#47083 updates the `master` branch to be compatible with NumPy 2. This PR adds a version bound for older releases, where it won't be applied.
### Does this PR introduce _any_ user-facing change?
NumPy will be limited to `numpy<2` when installing `pypspark` with extras `ml`, `mllib`, `sql`, `pandas_on_spark` or `connect`.
### How was this patch tested?
Via existing CI jobs.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes#47175 from codesorcery/SPARK-48710-numpy-upper-bound.
Authored-by: Patrick Marx <6949483+codesorcery@users.noreply.github.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
0 commit comments