Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel #1331

ericl · 2017-12-16T10:04:02Z

What do these changes do?

This is the fix-up PR for #1311

ericl · 2017-12-16T10:04:38Z

python/ray/autoscaler/aws/config.py

+        raise Exception(
+            "No subnets found, try manually creating an instance in "
+            "your specified region to populate the list of subnets "
+            "and trying this again.")


I'm not sure why the subnets list is empty until you do this; I ran into this problem when trying to use the script in a region I hadn't used before (us-west-2).

AmplabJenkins · 2017-12-16T10:46:08Z

Merged build finished. Test PASSed.

AmplabJenkins · 2017-12-16T10:46:09Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/2821/
Test PASSed.

robertnishihara · 2017-12-16T22:13:53Z

python/ray/autoscaler/aws/example.yaml

@@ -58,15 +58,13 @@ file_mounts: {

 # List of shell commands to run to initialize the head node.
 head_init_commands:
-    - cd ~/ray; git remote add eric https://github.com/ericl/ray.git || true
-    - cd ~/ray; git fetch eric && git reset --hard e1e97b3
+    - sudo pip3 install -U https://s3-us-west-2.amazonaws.com/ray-wheels/f5ea44338eca392df3a868035df3901829cc2ca1/ray-0.3.0-cp35-cp35m-manylinux1_x86_64.whl


Would it work if we get rid of sudo and add in --user

robertnishihara · 2017-12-16T22:40:21Z

Thanks @ericl this looks good.

Separately, what is the best way for developers to use this? E.g., should developers make a new AMI with Ray installed via python setup.py develop and then change the config to use that AMI?

Also, for development, would it make sense to pre-populate a workers.txt file on the head node? And the start_worker.sh/stop_worker.sh/upgrade.sh scripts?

robertnishihara

I left one comment. Feel free to address it or merge anyway.

ericl · 2017-12-17T00:02:49Z

@robertnishihara made that change, it seems to work.

Separately, what is the best way for developers to use this? E.g., should developers make a new AMI with Ray installed via python setup.py develop and then change the config to use that AMI?

Yeah, I snapshotted a new AMI that had my repo pre-cloned.

Also, for development, would it make sense to pre-populate a workers.txt file on the head node? And the start_worker.sh/stop_worker.sh/upgrade.sh scripts?

What I've been doing is adding a git checkout <my_sha> in the init commands and then running ray create_or_update, which updates the nodes. When I need to force restart the nodes I add some dummy init command (e.g. "echo foo") that triggers a ray restart on all nodes.

AmplabJenkins · 2017-12-17T00:43:30Z

Merged build finished. Test PASSed.

AmplabJenkins · 2017-12-17T00:43:31Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/2823/
Test PASSed.

robertnishihara · 2017-12-17T01:39:39Z

It's got to be unrelated to this PR, but this is the first time I saw the error

======================================================================
FAIL: testCleanupOnDriverExitManyRedisShards (__main__.MonitorTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "test/monitor_test.py", line 88, in testCleanupOnDriverExitManyRedisShards
    self._testCleanupOnDriverExit(num_redis_shards=5)
  File "test/monitor_test.py", line 79, in _testCleanupOnDriverExit
    self.assertEqual((0, 1), StateSummary()[:2])
AssertionError: Tuples differ: (0, 1) != (4, 3)

First differing element 0:
0
4

- (0, 1)
+ (4, 3)

cc @concretevitamin

robertnishihara · 2017-12-17T01:41:09Z

Filed an issue at #1332.

update

425a7c2

ericl assigned robertnishihara Dec 16, 2017

ericl commented Dec 16, 2017

View reviewed changes

robertnishihara reviewed Dec 16, 2017

View reviewed changes

robertnishihara approved these changes Dec 16, 2017

View reviewed changes

install --user

22a2e20

robertnishihara merged commit d21ea0c into ray-project:master Dec 17, 2017

robertnishihara deleted the fix-up-pip branch December 17, 2017 01:39

robertnishihara mentioned this pull request Dec 17, 2017

Test failure in monitor_test.py::test_cleanup_on_driver_exit_many_redis_shards. #1332

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel #1331

Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel #1331

Uh oh!

ericl commented Dec 16, 2017

Uh oh!

ericl Dec 16, 2017

Uh oh!

AmplabJenkins commented Dec 16, 2017

Uh oh!

AmplabJenkins commented Dec 16, 2017

Uh oh!

robertnishihara Dec 16, 2017

Uh oh!

robertnishihara commented Dec 16, 2017

Uh oh!

robertnishihara left a comment

Uh oh!

ericl commented Dec 17, 2017 •

edited

Loading

Uh oh!

AmplabJenkins commented Dec 17, 2017

Uh oh!

AmplabJenkins commented Dec 17, 2017

Uh oh!

robertnishihara commented Dec 17, 2017

Uh oh!

robertnishihara commented Dec 17, 2017

Uh oh!

Uh oh!

Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel #1331

Switch EC2 example config to use AWS deep learning AMI + latest Ray wheel #1331

Uh oh!

Conversation

ericl commented Dec 16, 2017

What do these changes do?

Uh oh!

ericl Dec 16, 2017

Choose a reason for hiding this comment

Uh oh!

AmplabJenkins commented Dec 16, 2017

Uh oh!

AmplabJenkins commented Dec 16, 2017

Uh oh!

robertnishihara Dec 16, 2017

Choose a reason for hiding this comment

Uh oh!

robertnishihara commented Dec 16, 2017

Uh oh!

robertnishihara left a comment

Choose a reason for hiding this comment

Uh oh!

ericl commented Dec 17, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmplabJenkins commented Dec 17, 2017

Uh oh!

AmplabJenkins commented Dec 17, 2017

Uh oh!

robertnishihara commented Dec 17, 2017

Uh oh!

robertnishihara commented Dec 17, 2017

Uh oh!

Uh oh!

ericl commented Dec 17, 2017 •

edited

Loading