[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers. #16667

vanzin · 2017-01-21T01:13:25Z

That method is prone to stack overflows when the input map is really
large; instead, use plain "map". Also includes a unit test that was
tested and caused stack overflows without the fix.

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit tests that was tested and caused stack overflows without the fix.

SparkQA · 2017-01-21T01:27:01Z

Test build #71751 has finished for PR 16667 at commit 5d33e3e.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2017-01-21T03:50:05Z

Argh, api not available in old hadoop... fix coming.

SparkQA · 2017-01-21T04:39:52Z

Test build #71756 has finished for PR 16667 at commit 16a99fc.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2017-01-22T00:28:05Z

@srowen @tgravescs

srowen

Looks fine. I'm surprised there is a difference but take your word for it if it fixes it. It should be functionally equivalent anyway.

tgravescs

+1. Just a couple small nits.

tgravescs · 2017-01-23T14:33:01Z

...nagers/yarn/src/test/scala/org/apache/spark/deploy/yarn/LocalityPlacementStrategySuite.scala

+import org.apache.hadoop.fs.CommonConfigurationKeysPublic
+import org.apache.hadoop.net.DNSToSwitchMapping
+import org.apache.hadoop.yarn.api.records._
+import org.apache.hadoop.yarn.client.api.AMRMClient.ContainerRequest


not used that I see, remove

tgravescs · 2017-01-23T14:41:08Z

...nagers/yarn/src/test/scala/org/apache/spark/deploy/yarn/LocalityPlacementStrategySuite.scala

+      yarnConf, resource)
+
+    val totalTasks = 32 * 1024
+    val totalContainers = totalTasks / 16


any particular reason for the 16 here? I assume its just random selected that shows the issue but perhaps add in a comment.

tgravescs · 2017-01-23T15:07:45Z

...nagers/yarn/src/test/scala/org/apache/spark/deploy/yarn/LocalityPlacementStrategySuite.scala

+    val count = containers.size / hosts.size / 2
+
+    val hostToContainerMap = new HashMap[String, Set[ContainerId]]()
+    hosts.keys.take(hosts.size / 2).zipWithIndex.foreach { case (host, i) =>


similar wouldn't hurt to have small description here for someone looking at it later

SparkQA · 2017-01-23T18:55:32Z

Test build #71867 has finished for PR 16667 at commit 68c8925.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tgravescs · 2017-01-25T14:16:26Z

+1

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit test that was tested and caused stack overflows without the fix. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #16667 from vanzin/SPARK-18750. (cherry picked from commit 76db394) Signed-off-by: Tom Graves <tgraves@yahoo-inc.com>

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit test that was tested and caused stack overflows without the fix. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #16667 from vanzin/SPARK-18750. (cherry picked from commit 76db394) Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit test that was tested and caused stack overflows without the fix. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes apache#16667 from vanzin/SPARK-18750.

…ainers. apache#16667 without adding LocalityPlacementStrategySuite.scala

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit test that was tested and caused stack overflows without the fix. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes apache#16667 from vanzin/SPARK-18750.

[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers.

5d33e3e

That method is prone to stack overflows when the input map is really large; instead, use plain "map". Also includes a unit tests that was tested and caused stack overflows without the fix.

Mock ContainerId to avoid cross-hadoop-version issues.

16a99fc

srowen approved these changes Jan 22, 2017

View reviewed changes

tgravescs reviewed Jan 23, 2017

View reviewed changes

Feedback.

68c8925

asfgit closed this in 76db394 Jan 25, 2017

vanzin deleted the SPARK-18750 branch January 27, 2017 00:57

zzcclp added a commit to zzcclp/spark that referenced this pull request Feb 7, 2017

[EXT][SPARK-18750][yarn] Avoid using "mapValues" when allocating cont…

3c6ba79

…ainers. apache#16667 without adding LocalityPlacementStrategySuite.scala

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers. #16667

[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers. #16667

Uh oh!

vanzin commented Jan 21, 2017 •

edited

Loading

Uh oh!

SparkQA commented Jan 21, 2017

Uh oh!

vanzin commented Jan 21, 2017

Uh oh!

SparkQA commented Jan 21, 2017

Uh oh!

vanzin commented Jan 22, 2017

Uh oh!

srowen left a comment

Uh oh!

tgravescs left a comment

Uh oh!

tgravescs Jan 23, 2017

Uh oh!

tgravescs Jan 23, 2017

Uh oh!

tgravescs Jan 23, 2017

Uh oh!

SparkQA commented Jan 23, 2017

Uh oh!

tgravescs commented Jan 25, 2017

Uh oh!

Uh oh!

[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers. #16667

[SPARK-18750][yarn] Avoid using "mapValues" when allocating containers. #16667

Uh oh!

Conversation

vanzin commented Jan 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Jan 21, 2017

Uh oh!

vanzin commented Jan 21, 2017

Uh oh!

SparkQA commented Jan 21, 2017

Uh oh!

vanzin commented Jan 22, 2017

Uh oh!

srowen left a comment

Choose a reason for hiding this comment

Uh oh!

tgravescs left a comment

Choose a reason for hiding this comment

Uh oh!

tgravescs Jan 23, 2017

Choose a reason for hiding this comment

Uh oh!

tgravescs Jan 23, 2017

Choose a reason for hiding this comment

Uh oh!

tgravescs Jan 23, 2017

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jan 23, 2017

Uh oh!

tgravescs commented Jan 25, 2017

Uh oh!

Uh oh!

vanzin commented Jan 21, 2017 •

edited

Loading