Skip to content

Commit

Permalink
[Serve] Rescale Serve's Long Running Test to Cluster Mode (ray-projec…
Browse files Browse the repository at this point in the history
…t#13247)

Now that `HeadOnly` becomes the new default HTTP location, we can
re-enable the long running tests to use local multi-clusters.
(also fixed the controller's API to match up to date, we should
have caught these, I will open issues for this.)
  • Loading branch information
simon-mo authored Jan 7, 2021
1 parent bcaff63 commit 01dcb99
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 5 deletions.
2 changes: 1 addition & 1 deletion release/long_running_tests/workloads/serve.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
num_redis_shards = 1
redis_max_memory = 10**8
object_store_memory = 10**8
num_nodes = 1
num_nodes = 4
cluster = Cluster()
for i in range(num_nodes):
cluster.add_node(
Expand Down
7 changes: 3 additions & 4 deletions release/long_running_tests/workloads/serve_failure.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
num_redis_shards = 1
redis_max_memory = 10**8
object_store_memory = 10**8
num_nodes = 1
num_nodes = 4
cpus_per_node = 10
cluster = Cluster()
for i in range(num_nodes):
Expand Down Expand Up @@ -39,10 +39,9 @@ def __init__(self, kill_period_s=1):

def _get_all_serve_actors(self):
controller = self.client._controller
routers = list(ray.get(controller.get_routers.remote()).values())
routers = list(ray.get(controller.get_http_proxies.remote()).values())
all_handles = routers + [controller]
worker_handle_dict = ray.get(
controller.get_all_worker_handles.remote())
worker_handle_dict = ray.get(controller._all_replica_handles.remote())
for _, replica_dict in worker_handle_dict.items():
all_handles.extend(list(replica_dict.values()))

Expand Down

0 comments on commit 01dcb99

Please sign in to comment.