[SPARK-8687][YARN]Fix bug: Executor can't fetch the new set configuration in yarn-client #7066

SaintBacchus · 2015-06-28T08:30:22Z

Spark initi the properties CoarseGrainedSchedulerBackend.start

    // TODO (prashant) send conf instead of properties
    driverEndpoint = rpcEnv.setupEndpoint(
      CoarseGrainedSchedulerBackend.ENDPOINT_NAME, new DriverEndpoint(rpcEnv, properties))

Then the yarn logic will set some configuration but not update in this properties.
So Executor won't gain the properties.

Jira

AmplabJenkins · 2015-06-28T08:32:12Z

Merged build triggered.

AmplabJenkins · 2015-06-28T08:32:20Z

Merged build started.

SparkQA · 2015-06-28T08:35:08Z

Test build #35931 has started for PR 7066 at commit e4dd9a8.

SparkQA · 2015-06-28T10:55:08Z

Test build #35931 has finished for PR 7066 at commit e4dd9a8.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- class DriverEndpoint(override val rpcEnv: RpcEnv, sparkConf: SparkConf)
- class Module(object):

AmplabJenkins · 2015-06-28T10:55:25Z

Merged build finished. Test PASSed.

andrewor14 · 2015-06-29T19:03:48Z

core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala

this is only used in 1 place. I would just inline it in L176.

after you move it, I would add a comment that says:

// Retrieve fresh properties from SparkConf again in case a new property is set // This is necessary for passing certain properties to executors in YARN (SPARK-8687)

andrewor14 · 2015-06-29T19:10:09Z

@SaintBacchus I looked at this quickly and I think this approach is reasonable. This seems to only affect YARN client mode where the YarnClientSchedulerBackend calls super.start() before setting spark.yarn.credentials.file.

My only concern with this approach is that different executors may get different configurations. It does potentially introduce an unlikely race condition, where executors get registered before we set the property.

andrewor14 · 2015-06-29T19:19:24Z

Actually, have you considered the following alternative? We modify YarnClientSchedulerBackend#start to call super.start() after we have submitted the application:

  override def start() {
    ...
    client = new Client(args, conf)
    appId = client.submitApplication()

    // SPARK-8687: Ensure all necessary properties have already been set before
    // we initialize our driver scheduler backend, which serves these properties
    // to the executors
    super.start()

    waitForApplication()
    monitorThread = asyncMonitorApplication()
    monitorThread.start()
  }

This guarantees that by the time we call super.start() we have already set all the necessary properties. There are no potential race conditions here and it is straightforward to reason about.

…ExecutorLaucher

AmplabJenkins · 2015-06-30T03:32:13Z

Merged build triggered.

AmplabJenkins · 2015-06-30T03:32:19Z

Merged build started.

SparkQA · 2015-06-30T03:32:45Z

Test build #36078 has started for PR 7066 at commit 1de4f48.

SaintBacchus · 2015-06-30T03:36:42Z

We modify YarnClientSchedulerBackend#start to call super.start() after we have submitted the application

@andrewor14 This modify is much more suitable for this problem. But if user had to set configuration in other deploy mode, they had to be cautious about this problem.

andrewor14 · 2015-06-30T03:46:52Z

In general I think we discourage the developers to set additional variables in the SparkConf after the SparkContext has started. Unfortunately this still happens sometimes especially in YARN.

I think the reordering is the simpler approach here, otherwise we'll have to worry about executors potentially getting different properties, which is much more confusing.

SparkQA · 2015-06-30T06:02:44Z

Test build #36078 has finished for PR 7066 at commit 1de4f48.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-06-30T06:03:20Z

Merged build finished. Test PASSed.

andrewor14 · 2015-07-02T06:13:54Z

Merging into master. Thanks @SaintBacchus!

…ration in yarn-client Spark initi the properties CoarseGrainedSchedulerBackend.start ```scala // TODO (prashant) send conf instead of properties driverEndpoint = rpcEnv.setupEndpoint( CoarseGrainedSchedulerBackend.ENDPOINT_NAME, new DriverEndpoint(rpcEnv, properties)) ``` Then the yarn logic will set some configuration but not update in this `properties`. So `Executor` won't gain the `properties`. [Jira](https://issues.apache.org/jira/browse/SPARK-8687) Author: huangzhaowei <carlmartinmax@gmail.com> Closes #7066 from SaintBacchus/SPARK-8687 and squashes the following commits: 1de4f48 [huangzhaowei] Ensure all necessary properties have already been set before startup ExecutorLaucher (cherry picked from commit 1b0c8e6) Signed-off-by: Andrew Or <andrew@databricks.com>

andrewor14 reviewed Jun 29, 2015
View reviewed changes

Ensure all necessary properties have already been set before startup …

1de4f48

…ExecutorLaucher

SaintBacchus changed the title ~~[SPARK-8687][YARN]Fix bug: Executor can't fetch the new set configuration~~ [SPARK-8687][YARN]Fix bug: Executor can't fetch the new set configuration in yarn-client Jun 30, 2015

asfgit closed this in 1b0c8e6 Jul 2, 2015

[SPARK-8687][YARN]Fix bug: Executor can't fetch the new set configuration in yarn-client #7066

[SPARK-8687][YARN]Fix bug: Executor can't fetch the new set configuration in yarn-client #7066

Uh oh!

Conversation

SaintBacchus commented Jun 28, 2015

Uh oh!

AmplabJenkins commented Jun 28, 2015

Uh oh!

AmplabJenkins commented Jun 28, 2015

Uh oh!

SparkQA commented Jun 28, 2015

Uh oh!

SparkQA commented Jun 28, 2015

Uh oh!

AmplabJenkins commented Jun 28, 2015

Uh oh!

andrewor14 Jun 29, 2015

Choose a reason for hiding this comment

Uh oh!

andrewor14 Jun 29, 2015

Choose a reason for hiding this comment

Uh oh!

andrewor14 commented Jun 29, 2015

Uh oh!

andrewor14 commented Jun 29, 2015

Uh oh!

AmplabJenkins commented Jun 30, 2015

Uh oh!

AmplabJenkins commented Jun 30, 2015

Uh oh!

SparkQA commented Jun 30, 2015

Uh oh!

SaintBacchus commented Jun 30, 2015

Uh oh!

andrewor14 commented Jun 30, 2015

Uh oh!

SparkQA commented Jun 30, 2015

Uh oh!

AmplabJenkins commented Jun 30, 2015

Uh oh!

andrewor14 commented Jul 2, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants