Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-32017][PYTHON][FOLLOW-UP] Rename HADOOP_VERSION to PYSPARK_HADOOP_VERSION in pip installation option #31028

Closed
wants to merge 1 commit into from

Conversation

HyukjinKwon
Copy link
Member

What changes were proposed in this pull request?

This PR is a followup of #29703.
It renames HADOOP_VERSION environment variable to PYSPARK_HADOOP_VERSION in case HADOOP_VERSION is already being used somewhere. Arguably HADOOP_VERSION is a pretty common name. I see here and there:

Why are the changes needed?

To avoid the environment variables is unexpectedly conflicted.

Does this PR introduce any user-facing change?

It renames the environment variable but it's not released yet.

How was this patch tested?

Existing unittests will test.

@HyukjinKwon
Copy link
Member Author

cc @ueshin, @dongjoon-hyun, @mengxr, @viirya FYI. Thanks for pointing this out @mengxr.

python/setup.py Show resolved Hide resolved
Copy link
Member

@viirya viirya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yea, I think PYSPARK_HADOOP_VERSION is a better name.

@HyukjinKwon
Copy link
Member Author

Thanks @cloud-fan and @viirya !

@HyukjinKwon
Copy link
Member Author

Allow me to rush and merge. I am going to cut the RC now. I will monitor the builds and make sure nothing is broken.

@HyukjinKwon
Copy link
Member Author

Merged to master and branch-3.1.

HyukjinKwon added a commit that referenced this pull request Jan 5, 2021
…OOP_VERSION in pip installation option

### What changes were proposed in this pull request?

This PR is a followup of #29703.
It renames `HADOOP_VERSION` environment variable to `PYSPARK_HADOOP_VERSION` in case `HADOOP_VERSION` is already being used somewhere. Arguably `HADOOP_VERSION` is a pretty common name. I see here and there:
- https://www.ibm.com/support/knowledgecenter/SSZUMP_7.2.1/install_grid_sym/understanding_advanced_edition.html
- https://cwiki.apache.org/confluence/display/ARROW/HDFS+Filesystem+Support
- http://crs4.github.io/pydoop/_pydoop1/installation.html

### Why are the changes needed?

To avoid the environment variables is unexpectedly conflicted.

### Does this PR introduce _any_ user-facing change?

It renames the environment variable but it's not released yet.

### How was this patch tested?

Existing unittests will test.

Closes #31028 from HyukjinKwon/SPARK-32017-followup.

Authored-by: HyukjinKwon <gurwls223@apache.org>
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
(cherry picked from commit 329850c)
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
@SparkQA
Copy link

SparkQA commented Jan 5, 2021

Test build #133663 has finished for PR 31028 at commit e764a7a.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

+1, late LGTM.

@HyukjinKwon HyukjinKwon deleted the SPARK-32017-followup branch January 4, 2022 00:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants