-
Notifications
You must be signed in to change notification settings - Fork 28.6k
[SPARK-31484][Core][Flollowup] Use taskAttemptId in checkpoint filename #28289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Test build #121606 has finished for PR 28289 at commit
|
retest this please |
@@ -199,8 +199,7 @@ private[spark] object ReliableCheckpointRDD extends Logging { | |||
|
|||
val finalOutputName = ReliableCheckpointRDD.checkpointFileName(ctx.partitionId()) | |||
val finalOutputPath = new Path(outputDir, finalOutputName) | |||
val tempOutputPath = new Path(outputDir, | |||
s".$finalOutputName-attempt-${ctx.stageAttemptNumber()}-${ctx.attemptNumber()}") | |||
val tempOutputPath = new Path(outputDir, s".$finalOutputName-attempt-${ctx.taskAttemptId()}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ya. It looks simpler.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BTW, do we need -attempt-
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
private[spark] class TaskDescription(
val taskId: Long,
val attemptNumber: Int,
...
I believe taskAttempId
is some kind of historic name in TaskContext
.
Test build #121612 has finished for PR 28289 at commit
|
Merged to master. |
Thanks! |
What changes were proposed in this pull request?
As suggested by #28255 (comment), this patch proposes to use taskAttemptId in checkpoint filename, instead of stageAttemptNumber + attemptNumber.
Why are the changes needed?
To simplify checkpoint simplified and unique.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Existing tests.