[SPARK-45250][core] Support stage level task resource profile for yarn cluster when dynamic allocation disabled #43030

wbo4958 · 2023-09-21T12:02:01Z

What changes were proposed in this pull request?

This PR is a follow-up of #37268 which supports stage level task resource profile for standalone cluster when dynamic allocation disabled. This PR enables stage-level task resource profile for yarn cluster.

Why are the changes needed?

Users who work on spark ML/DL cases running on Yarn would expect stage-level task resource profile feature.

Does this PR introduce any user-facing change?

No

How was this patch tested?

The current tests of #37268 can also cover this PR since both yarn and standalone cluster share the same TaskSchedulerImpl class which implements this feature. Apart from that, modifying the existing test to cover yarn cluster. Apart from that, I also performed some manual tests which have been updated in the comments.

Was this patch authored or co-authored using generative AI tooling?

No

wbo4958 · 2023-09-21T12:09:49Z

Hi @ivoson, I believe this PR is a follow-up of your previous PR #37268 since both yarn and standalone cluster share the same TaskSchedulerImpl. But I still would like to know what's your concern about this PR. Thx.

tgravescs · 2023-09-26T13:51:23Z

Please update description to say what this PR does. Specifically this does introduce user facing changes because there is now a new feature available to users of yarn.

Was this tested on a real cluster, what all tests were run other then unit tests?

You also need to update the documentation: docs/configuration.md, docs/running-on-yarn.md

core/src/main/scala/org/apache/spark/resource/ResourceProfileManager.scala

core/src/test/scala/org/apache/spark/resource/ResourceProfileManagerSuite.scala

wbo4958 · 2023-09-26T23:39:41Z

Put it in draft until Kubernetes support is available. @tgravescs thx for the review.

ivoson · 2023-09-27T03:17:32Z

Hi @ivoson, I believe this PR is a follow-up of your previous PR #37268 since both yarn and standalone cluster share the same TaskSchedulerImpl. But I still would like to know what's your concern about this PR. Thx.

Thanks @wbo4958 for ping me and work on this. No concerns about adding the support for yarn cluster. Please feel free to go ahead.

wbo4958 · 2023-09-27T07:49:57Z

Manual tests

Due to the challenges of conducting yarn application tests within Spark unit tests, I took the initiative to manually perform several tests on our internal Yarn cluster.

With dynamic allocation disabled.

spark-shell --master yarn --num-executors=1 --executor-cores=4 --conf spark.task.cpus=1 \
   --conf spark.dynamicAllocation.enabled=false

The above command requires 1 executor with 4 CPU cores, and the default task.cpus = 1, so the default tasks parallelism is 4 at a time.

task.cores=1

Test code:

import org.apache.spark.resource.{ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)

val treqs = new TaskResourceRequests().cpus(1)
val rp = new ResourceProfileBuilder().require(treqs).build

rdd1 = rdd1.withResources(rp)
rdd1.collect()

When the required task.cpus=1, executor.cores=4 (No executor resource specified, use the default one), there will be 4 tasks running for rp.

The entire Spark application consists of a single Spark job that will be divided into two stages. The first shuffle stage comprises four tasks, all of which will be executed simultaneously.

And the second ResultStage comprises 3 tasks, and all of which will be executed simultaneously since the required task.cpus is 1.

task.cores=2

Test code,

import org.apache.spark.resource.{ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)

val treqs = new TaskResourceRequests().cpus(2)
val rp = new ResourceProfileBuilder().require(treqs).build

rdd1 = rdd1.withResources(rp)
rdd1.collect()

When the required task.cpus=2, executor.cores=4 (No executor resource specified, use the default one), there will be 2 tasks running for rp.

The first shuffle stage behaves the same as the first one.

The second ResultStage comprises 3 tasks, so the first 2 tasks will be running at a time, and then execute the last task.

task.cores=3

Test code,

import org.apache.spark.resource.{ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)

val treqs = new TaskResourceRequests().cpus(3)
val rp = new ResourceProfileBuilder().require(treqs).build

rdd1 = rdd1.withResources(rp)
rdd1.collect()

When the required task.cpus=3, executor.cores=4 (No executor resource specified, use the default one), there will be 1 task running for rp.

The first shuffle stage behaves the same as the first one.

The second ResultStage comprises 3 tasks, all of which will be running serially.

task.cores=5

import org.apache.spark.resource.{ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)
val treqs = new TaskResourceRequests().cpus(5)
val rp = new ResourceProfileBuilder().require(treqs).build

rdd1 = rdd1.withResources(rp)

exception happened.

scala> rdd1 = rdd1.withResources(rp)
org.apache.spark.SparkException: The number of cores per executor (=4) has to be >= the number of cpus per task = 5.
  at org.apache.spark.resource.ResourceUtils$.validateTaskCpusLargeEnough(ResourceUtils.scala:412)
  at org.apache.spark.resource.ResourceProfile.calculateTasksAndLimitingResource(ResourceProfile.scala:182)
  at org.apache.spark.resource.ResourceProfile.$anonfun$limitingResource$1(ResourceProfile.scala:152)
  at scala.Option.getOrElse(Option.scala:189)
  at org.apache.spark.resource.ResourceProfile.limitingResource(ResourceProfile.scala:151)
  at org.apache.spark.resource.ResourceProfileManager.addResourceProfile(ResourceProfileManager.scala:141)
  at org.apache.spark.rdd.RDD.withResources(RDD.scala:1829)
  ... 50 elided

scala>

wbo4958 · 2023-09-27T08:11:01Z

With dynamic allocation enabled.

spark-shell --master yarn --num-executors=1 --executor-cores=4 --conf spark.task.cpus=1 \
  --conf spark.dynamicAllocation.enabled=true --conf spark.dynamicAllocation.maxExecutors=1\

The above command enables the dynamic allocation and the max executors required is set to 1 in order to test.

TaskResourceProfile without any specific executor request information

Test code,

import org.apache.spark.resource.{ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)

val treqs = new TaskResourceRequests().cpus(3)
val rp = new ResourceProfileBuilder().require(treqs).build

rdd1 = rdd1.withResources(rp)
rdd1.collect()

The rp refers to the TaskResourceProfile without any specific executor request information, thus the executor information will utilize the default values from Default ResourceProfile (executor.cores=4).

The above code will require an extra executor which will have the same executor.cores/memory as the default ResourceProfile.

Different executor request information

import org.apache.spark.resource.{ExecutorResourceRequests, ResourceProfileBuilder, TaskResourceRequests}

val rdd = sc.range(0, 100, 1, 4)
var rdd1 = rdd.repartition(3)

val ereqs = new ExecutorResourceRequests().cores(6);
val treqs = new TaskResourceRequests().cpus(5)

val rp = new ResourceProfileBuilder().require(ereqs).require(treqs).build

rdd1 = rdd1.withResources(rp)
rdd1.collect()

wbo4958 · 2023-09-27T08:16:40Z

Hi @tgravescs @ivoson, Would you please help to review it again? Since it's hard to do the Yarn end 2 end tests on spark unit tests. I did some manual tests, please see the above comments.

BTW, this PR only supports Yarn since I'm not familiar with k8s for now. I will put up another PR for k8s. Thx for the understanding.

tgravescs · 2023-09-27T19:48:27Z

You still need to update the documentation like I mentioned here: #43030 (comment)

Also need to look at the build failure, doesn't look like this code so maybe something with setup in your repo.

tgravescs · 2023-09-28T14:26:58Z

overall code looks good, we need to figure out why tests aren't running/passing.

wbo4958 · 2023-09-28T14:30:58Z

Thx @tgravescs, I reran the build/tests, but they still kept failing. Let me rerun them again.

core/src/main/scala/org/apache/spark/resource/ResourceProfileManager.scala

tgravescs · 2023-09-29T13:39:36Z

@HyukjinKwon would you have any idea on the build failure? Looks like env is just not setup properly but I've never seen that before.

wbo4958 · 2023-10-02T04:51:43Z

Hi @tgravescs, Finally, I rebased this PR and forced-updated my PR, now the CI got passed. Thx

mridulm · 2023-10-03T04:02:15Z

Merged to master.
Thanks for working on this @wbo4958 !
Thanks for the reviews @tgravescs and @ivoson :-)

tgravescs · 2023-10-04T13:02:27Z

thanks @mridulm any throught/objections to pulling this back into 3.5 line?
Seems fairly low risk improvement

mridulm · 2023-10-04T13:16:54Z

I did not backport it given it was an improvement, but don't have objections as such - as you said, it is low risk

…n cluster when dynamic allocation disabled ### What changes were proposed in this pull request? This PR is a follow-up of #37268 which supports stage level task resource profile for standalone cluster when dynamic allocation disabled. This PR enables stage-level task resource profile for yarn cluster. ### Why are the changes needed? Users who work on spark ML/DL cases running on Yarn would expect stage-level task resource profile feature. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? The current tests of #37268 can also cover this PR since both yarn and standalone cluster share the same TaskSchedulerImpl class which implements this feature. Apart from that, modifying the existing test to cover yarn cluster. Apart from that, I also performed some manual tests which have been updated in the comments. ### Was this patch authored or co-authored using generative AI tooling? No Closes #43030 from wbo4958/yarn-task-resoure-profile. Authored-by: Bobby Wang <wbo4958@gmail.com> Signed-off-by: Mridul Muralidharan <mridul<at>gmail.com> (cherry picked from commit 5b80639) Signed-off-by: Thomas Graves <tgraves@apache.org>

tgravescs · 2023-10-05T14:25:10Z

thanks, I merged back into 3.5 branch (3.5.1) as well.

github-actions bot added the CORE label Sep 21, 2023

tgravescs reviewed Sep 26, 2023

View reviewed changes

wbo4958 marked this pull request as draft September 26, 2023 23:35

wbo4958 marked this pull request as ready for review September 27, 2023 07:50

github-actions bot added the DOCS label Sep 28, 2023

mridulm reviewed Sep 28, 2023

View reviewed changes

core/src/main/scala/org/apache/spark/resource/ResourceProfileManager.scala Outdated Show resolved Hide resolved

mridulm approved these changes Sep 29, 2023

View reviewed changes

wbo4958 force-pushed the yarn-task-resoure-profile branch from 5e01eae to 4f68533 Compare October 1, 2023 00:23

wbo4958 added 4 commits October 2, 2023 06:09

Yarn: supports task resource profile

76642f8

Resolve comments

64e4a4b

update document

155a622

remove variable and inline the check.

f5a105f

wbo4958 force-pushed the yarn-task-resoure-profile branch from 4f68533 to f5a105f Compare October 1, 2023 22:10

tgravescs approved these changes Oct 2, 2023

View reviewed changes

mridulm closed this in 5b80639 Oct 3, 2023

wbo4958 mentioned this pull request Oct 11, 2023

[SPARK-45495][core] Support stage level task resource profile for k8s cluster when dynamic allocation disabled #43323

Closed

wbo4958 deleted the yarn-task-resoure-profile branch January 10, 2024 06:25

[SPARK-45250][core] Support stage level task resource profile for yarn cluster when dynamic allocation disabled #43030

[SPARK-45250][core] Support stage level task resource profile for yarn cluster when dynamic allocation disabled #43030

Uh oh!

Conversation

wbo4958 commented Sep 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

wbo4958 commented Sep 21, 2023

Uh oh!

tgravescs commented Sep 26, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wbo4958 commented Sep 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ivoson commented Sep 27, 2023

Uh oh!

wbo4958 commented Sep 27, 2023

Manual tests

With dynamic allocation disabled.

Uh oh!

wbo4958 commented Sep 27, 2023

With dynamic allocation enabled.

TaskResourceProfile without any specific executor request information

Different executor request information

Uh oh!

wbo4958 commented Sep 27, 2023

Uh oh!

tgravescs commented Sep 27, 2023

Uh oh!

tgravescs commented Sep 28, 2023

Uh oh!

wbo4958 commented Sep 28, 2023

Uh oh!

Uh oh!

tgravescs commented Sep 29, 2023

Uh oh!

wbo4958 commented Oct 2, 2023

Uh oh!

mridulm commented Oct 3, 2023

Uh oh!

tgravescs commented Oct 4, 2023

Uh oh!

mridulm commented Oct 4, 2023

Uh oh!

tgravescs commented Oct 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wbo4958 commented Sep 21, 2023 •

edited

Loading

wbo4958 commented Sep 26, 2023 •

edited

Loading