Skip to content

[SPARK-13478][YARN] Use real user when fetching delegation tokens. #16665

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

vanzin
Copy link
Contributor

@vanzin vanzin commented Jan 20, 2017

The Hive client library is not smart enough to notice that the current
user is a proxy user; so when using a proxy user, it fails to fetch
delegation tokens from the metastore because of a missing kerberos
TGT for the current user.

To fix it, just run the code that fetches the delegation token as the
real logged in user.

Tested on a kerberos cluster both submitting normally and with a proxy
user; Hive and HBase tokens are retrieved correctly in both cases.

Author: Marcelo Vanzin vanzin@cloudera.com

Closes #11358 from vanzin/SPARK-13478.

(cherry picked from commit c7fccb5)

The Hive client library is not smart enough to notice that the current
user is a proxy user; so when using a proxy user, it fails to fetch
delegation tokens from the metastore because of a missing kerberos
TGT for the current user.

To fix it, just run the code that fetches the delegation token as the
real logged in user.

Tested on a kerberos cluster both submitting normally and with a proxy
user; Hive and HBase tokens are retrieved correctly in both cases.

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes apache#11358 from vanzin/SPARK-13478.

(cherry picked from commit c7fccb5)
@vanzin
Copy link
Contributor Author

vanzin commented Jan 20, 2017

There was a minor conflict but otherwise a trivial backport. Just running tests to make sure.

@SparkQA
Copy link

SparkQA commented Jan 21, 2017

Test build #71747 has finished for PR 16665 at commit e847ab0.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor Author

vanzin commented Jan 21, 2017

seems unrelated but... retest this please

@SparkQA
Copy link

SparkQA commented Jan 21, 2017

Test build #71757 has finished for PR 16665 at commit e847ab0.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor Author

vanzin commented Jan 21, 2017

retest this please

@SparkQA
Copy link

SparkQA commented Jan 21, 2017

Test build #71762 has finished for PR 16665 at commit e847ab0.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor Author

vanzin commented Jan 21, 2017

retest this please

@SparkQA
Copy link

SparkQA commented Jan 22, 2017

Test build #71775 has finished for PR 16665 at commit e847ab0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor Author

vanzin commented Jan 22, 2017

Yay. Merging to 1.6.

@vanzin vanzin closed this Jan 22, 2017
asfgit pushed a commit that referenced this pull request Jan 22, 2017
The Hive client library is not smart enough to notice that the current
user is a proxy user; so when using a proxy user, it fails to fetch
delegation tokens from the metastore because of a missing kerberos
TGT for the current user.

To fix it, just run the code that fetches the delegation token as the
real logged in user.

Tested on a kerberos cluster both submitting normally and with a proxy
user; Hive and HBase tokens are retrieved correctly in both cases.

Author: Marcelo Vanzin <vanzincloudera.com>

Closes #11358 from vanzin/SPARK-13478.

(cherry picked from commit c7fccb5)

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes #16665 from vanzin/SPARK-13478_1.6.
zzcclp pushed a commit to zzcclp/spark that referenced this pull request Jan 22, 2017
The Hive client library is not smart enough to notice that the current
user is a proxy user; so when using a proxy user, it fails to fetch
delegation tokens from the metastore because of a missing kerberos
TGT for the current user.

To fix it, just run the code that fetches the delegation token as the
real logged in user.

Tested on a kerberos cluster both submitting normally and with a proxy
user; Hive and HBase tokens are retrieved correctly in both cases.

Author: Marcelo Vanzin <vanzincloudera.com>

Closes apache#11358 from vanzin/SPARK-13478.

(cherry picked from commit c7fccb5)

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes apache#16665 from vanzin/SPARK-13478_1.6.

(cherry picked from commit e78138a)
@vanzin vanzin deleted the SPARK-13478_1.6 branch January 23, 2017 18:44
mgummelt pushed a commit to d2iq-archive/spark that referenced this pull request Mar 7, 2017
The Hive client library is not smart enough to notice that the current
user is a proxy user; so when using a proxy user, it fails to fetch
delegation tokens from the metastore because of a missing kerberos
TGT for the current user.

To fix it, just run the code that fetches the delegation token as the
real logged in user.

Tested on a kerberos cluster both submitting normally and with a proxy
user; Hive and HBase tokens are retrieved correctly in both cases.

Author: Marcelo Vanzin <vanzincloudera.com>

Closes apache#11358 from vanzin/SPARK-13478.

(cherry picked from commit c7fccb5)

Author: Marcelo Vanzin <vanzin@cloudera.com>

Closes apache#16665 from vanzin/SPARK-13478_1.6.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants