HADOOP-18402. S3A committer NPE in spark job abort #4735

steveloughran · 2022-08-11T19:42:19Z

jobId.toString() to only be called when the ID isn't null.

this doesn't surface in MR, but spark seems to manage it

How was this patch tested?

through my downstream runs of spark integration tests

For code changes:

Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

jobId.toString() to only be called when the ID isn't null. this doesn't surface in MR, but spark seems to manage it Change-Id: I06692ef30a4af510c660d7222292932a8d4b5147

steveloughran · 2022-08-11T19:43:27Z

tests in progress

steveloughran · 2022-08-11T19:53:25Z

tested, s3 london -Dscale

steveloughran · 2022-08-11T19:54:49Z

@mukund-thakur @mehakmeet @sunchao @dongjoon-hyun

this is not anything anyone has shipped yet

hadoop-yetus · 2022-08-11T21:21:30Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 42s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
-1 ❌	test4tests	0m 0s		The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
			_ trunk Compile Tests _
+1 💚	mvninstall	38m 17s		trunk passed
+1 💚	compile	1m 2s		trunk passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚	compile	0m 55s		trunk passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚	checkstyle	0m 52s		trunk passed
+1 💚	mvnsite	1m 1s		trunk passed
+1 💚	javadoc	0m 49s		trunk passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚	javadoc	0m 50s		trunk passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚	spotbugs	1m 34s		trunk passed
+1 💚	shadedclient	21m 6s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 40s		the patch passed
+1 💚	compile	0m 43s		the patch passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚	javac	0m 43s		the patch passed
+1 💚	compile	0m 37s		the patch passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚	javac	0m 37s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	0m 28s		the patch passed
+1 💚	mvnsite	0m 41s		the patch passed
+1 💚	javadoc	0m 24s		the patch passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚	javadoc	0m 32s		the patch passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚	spotbugs	1m 15s		the patch passed
+1 💚	shadedclient	20m 20s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	2m 43s		hadoop-aws in the patch passed.
+1 💚	asflicense	0m 52s		The patch does not generate ASF License warnings.
		97m 47s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4735/1/artifact/out/Dockerfile
GITHUB PR	#4735
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	Linux 49009a046f35 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `b937b0a`
Default Java	Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4735/1/testReport/
Max. process+thread count	623 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws U: hadoop-tools/hadoop-aws
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4735/1/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

dongjoon-hyun · 2022-08-11T22:20:34Z

Thank you for pinging me, @steveloughran .

sunchao

LGTM

(managed to commit through the github ui before I'd got the message done) This reverts commit ad83e95.

JobID.toString() and TaskID.toString() to only be called when the IDs are not null. This doesn't surface in MapReduce, but Spark SQL can trigger in job abort, where it may invok abortJob() with an incomplete TaskContext. This patch MUST be applied to branches containing HADOOP-17833. "Improve Magic Committer Performance." Contributed by Steve Loughran.

JobID.toString() and TaskID.toString() to only be called when the IDs are not null. This doesn't surface in MapReduce, but Spark SQL can trigger in job abort, where it may invoke abortJob() with an incomplete TaskContext. This patch MUST be applied to branches containing HADOOP-17833. "Improve Magic Committer Performance." Contributed by Steve Loughran.

jobId.toString() to only be called when the ID isn't null. this doesn't surface in MR, but spark seems to manage it Change-Id: I06692ef30a4af510c660d7222292932a8d4b5147

…)" (managed to commit through the github ui before I'd got the message done) This reverts commit ad83e95.

JobID.toString() and TaskID.toString() to only be called when the IDs are not null. This doesn't surface in MapReduce, but Spark SQL can trigger in job abort, where it may invok abortJob() with an incomplete TaskContext. This patch MUST be applied to branches containing HADOOP-17833. "Improve Magic Committer Performance." Contributed by Steve Loughran.

HADOOP-18402. S3A committer NPE in spark job abort

b937b0a

jobId.toString() to only be called when the ID isn't null. this doesn't surface in MR, but spark seems to manage it Change-Id: I06692ef30a4af510c660d7222292932a8d4b5147

sunchao approved these changes Aug 12, 2022

View reviewed changes

steveloughran merged commit ad83e95 into apache:trunk Aug 15, 2022

asfgit pushed a commit that referenced this pull request Aug 15, 2022

Revert "HADOOP-18402. S3A committer NPE in spark job abort (#4735)"

eee59a8

(managed to commit through the github ui before I'd got the message done) This reverts commit ad83e95.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HADOOP-18402. S3A committer NPE in spark job abort #4735

HADOOP-18402. S3A committer NPE in spark job abort #4735

Uh oh!

steveloughran commented Aug 11, 2022 •

edited

Loading

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

hadoop-yetus commented Aug 11, 2022

Uh oh!

dongjoon-hyun commented Aug 11, 2022

Uh oh!

sunchao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HADOOP-18402. S3A committer NPE in spark job abort #4735

HADOOP-18402. S3A committer NPE in spark job abort #4735

Uh oh!

Conversation

steveloughran commented Aug 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How was this patch tested?

For code changes:

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

steveloughran commented Aug 11, 2022

Uh oh!

hadoop-yetus commented Aug 11, 2022

Uh oh!

dongjoon-hyun commented Aug 11, 2022

Uh oh!

sunchao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

steveloughran commented Aug 11, 2022 •

edited

Loading