[SPARK-6193] [EC2] Push group filter up to EC2 #4922

nchammas · 2015-03-06T00:57:49Z

When looking for a cluster, spark-ec2 currently pulls down info for all instances and filters locally. When working on an AWS account with hundreds of active instances, this step alone can take over 10 seconds.

This PR improves how spark-ec2 searches for clusters by pushing the filter up to EC2.

Basically, the problem (and solution) look like this:

>>> timeit.timeit('blah = conn.get_all_reservations()', setup='from __main__ import conn', number=10)
116.96390509605408
>>> timeit.timeit('blah = conn.get_all_reservations(filters={"instance.group-name": ["my-cluster-master"]})', setup='from __main__ import conn', number=10)
4.629754066467285

Translated to a user-visible action, this looks like (against an AWS account with ~200 active instances):

# master
$ python -m timeit -n 3 --setup 'import subprocess' 'subprocess.call("./spark-ec2 get-master my-cluster --region us-west-2", shell=True)'
...
3 loops, best of 3: 9.83 sec per loop

# this PR
$ python -m timeit -n 3 --setup 'import subprocess' 'subprocess.call("./spark-ec2 get-master my-cluster --region us-west-2", shell=True)'
...
3 loops, best of 3: 1.47 sec per loop

This PR also refactors get_existing_cluster() to make it, I hope, simpler.

Finally, this PR fixes some minor grammar issues related to printing status to the user. 🎩 👏

SparkQA · 2015-03-06T01:03:54Z

Test build #28321 has started for PR 4922 at commit f2a5b9f.

This patch merges cleanly.

nchammas · 2015-03-06T01:11:06Z

cc @shivaram

SparkQA · 2015-03-06T02:25:29Z

Test build #28321 has finished for PR 4922 at commit f2a5b9f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-06T02:25:33Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28321/
Test PASSed.

shivaram · 2015-03-06T18:59:17Z

ec2/spark_ec2.py

+        reservations = conn.get_all_reservations(
+            filters={"instance.group-name": group_names})
+        instances = itertools.chain.from_iterable(r.instances for r in reservations)
+        return [i for i in instances if i.state != "terminated"]


Minor comment: this seems a little different from the old check we had. We were originally checking if it was one of 'pending', 'running', 'stopping', 'stopped' while right now we check if its not terminated. There are other states as shown in http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-instance-lifecycle.html but I don't think it should matter.

The doc referenced in the comment notes these instance states:

0 : pending 16 : running 32 : shutting-down 48 : terminated 64 : stopping 80 : stopped

So it looks like the only state we're missing is shutting-down, which is the intermediate state right before terminated. I can add that in to be consistent with the previous behavior.

Alternately, we can leave it as-is and re-terminate instances even if they are shutting-down. You know, zombies and stuff. 👹 🔫

Yeah it doesn't look like shutting-down is a major issue, but it might just be safer to keep existing behavior

OK, fixified.

shivaram · 2015-03-06T19:01:06Z

This is a nice change @nchammas -- Code looks good to me, but I'd just like to try it out once on my machine.

shivaram · 2015-03-06T19:17:32Z

Just tried this out and it works fine. LGTM pending the minor inline comment

SparkQA · 2015-03-06T19:37:42Z

Test build #28353 has started for PR 4922 at commit 18802f1.

This patch merges cleanly.

SparkQA · 2015-03-06T21:02:42Z

Test build #28353 has finished for PR 4922 at commit 18802f1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

AmplabJenkins · 2015-03-06T21:02:46Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28353/
Test PASSed.

nchammas · 2015-03-07T17:35:20Z

Inline comment addressed.

srowen · 2015-03-08T14:00:38Z

Sounds like a good improvement, changes look OK to my mildly informed eyes, and you have both reviewed and tested the change. LGTM.

nchammas added 2 commits March 5, 2015 19:29

push group filter up to EC2

d96a489

fix grammar

f2a5b9f

shivaram reviewed Mar 6, 2015
View reviewed changes

ignore shutting-down

18802f1

asfgit closed this in 52ed7da Mar 8, 2015

nchammas deleted the get-existing-cluster-faster branch March 8, 2015 23:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-6193] [EC2] Push group filter up to EC2 #4922

[SPARK-6193] [EC2] Push group filter up to EC2 #4922

Uh oh!

nchammas commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

nchammas commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

AmplabJenkins commented Mar 6, 2015

Uh oh!

shivaram Mar 6, 2015

Uh oh!

nchammas Mar 6, 2015

Uh oh!

shivaram Mar 6, 2015

Uh oh!

nchammas Mar 6, 2015

Uh oh!

shivaram commented Mar 6, 2015

Uh oh!

shivaram commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

AmplabJenkins commented Mar 6, 2015

Uh oh!

nchammas commented Mar 7, 2015

Uh oh!

srowen commented Mar 8, 2015

Uh oh!

Uh oh!

[SPARK-6193] [EC2] Push group filter up to EC2 #4922

[SPARK-6193] [EC2] Push group filter up to EC2 #4922

Uh oh!

Conversation

nchammas commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

nchammas commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

AmplabJenkins commented Mar 6, 2015

Uh oh!

shivaram Mar 6, 2015

Choose a reason for hiding this comment

Uh oh!

nchammas Mar 6, 2015

Choose a reason for hiding this comment

Uh oh!

shivaram Mar 6, 2015

Choose a reason for hiding this comment

Uh oh!

nchammas Mar 6, 2015

Choose a reason for hiding this comment

Uh oh!

shivaram commented Mar 6, 2015

Uh oh!

shivaram commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

SparkQA commented Mar 6, 2015

Uh oh!

AmplabJenkins commented Mar 6, 2015

Uh oh!

nchammas commented Mar 7, 2015

Uh oh!

srowen commented Mar 8, 2015

Uh oh!

Uh oh!