Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create RemoteFunction class, remove FunctionProperties, simplify worker Python code. #2052

Merged
merged 10 commits into from
May 14, 2018
Merged

Create RemoteFunction class, remove FunctionProperties, simplify worker Python code. #2052

merged 10 commits into from
May 14, 2018

Conversation

robertnishihara
Copy link
Collaborator

Related to #614 and #2016. Some initial cleanups.

@richardliaw
Copy link
Contributor

richardliaw commented May 13, 2018

Would it make sense for @ray.method to be renamed as @ray.split?
This would resolve this issue for singletons (and clean up sharded parameter server impls): #1665

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5365/
Test PASSed.

@robertnishihara
Copy link
Collaborator Author

The reasoning for ray.method was that we may want to add additional arguments into the decorator later.

Your suggestion would definitely clean up certain applications, although it might be confusing for @ray.method(num_return_vals=1) to be different from no decorator.

@robertnishihara
Copy link
Collaborator Author

robertnishihara commented May 14, 2018

Though it sounds like the current behavior is also confusing as described in #1665.

If we do that, then we should probably also change the behavior of @ray.remote(num_return_vals=1).

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5367/
Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5368/
Test FAILed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5369/
Test PASSed.

@richardliaw
Copy link
Contributor

I think having a list of return values would be better (whether it be num_return_vals or some better naming)

@robertnishihara
Copy link
Collaborator Author

@richardliaw you're talking about the case where num_return_vals is specified, right? But not the default case, right? That might make sense, I'm curious what others think. That said, I'd prefer not to introduce any API changes in this PR.

@robertnishihara
Copy link
Collaborator Author

@suquark, this PR doesn't fix the issues you bring up in #2016, but some of these refactorings are necessary in order to address those issues.

@richardliaw
Copy link
Contributor

yeah, for the specified case. Subsequent PR would be fine if others also approve.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5374/
Test PASSed.

Copy link
Contributor

@richardliaw richardliaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nits

function_id_hash.update(inspect.getsource(function).encode("ascii"))
# Compute the function ID.
function_id = function_id_hash.digest()
assert len(function_id) == 20
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is 20 a ray constant?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, that's the size of our IDs. I started fixing this, by replacing it with a variable ray._ID_SIZE. However, this touches many many files so I think I'll submit a separate PR for that (the 20 isn't introduced here, it is just moved around from somewhere else).

num_cpus=None,
num_gpus=None,
resources=None):
"""An experimental alternate way to submit remote functions."""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

given that this is marked experimental, is this used?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's used in Pandas on Ray (to change the number of return values), and I think it will be used more in the future (e.g., to specify different resource requirements at runtime).

@@ -952,8 +978,8 @@ def _wait_for_and_process_task(self, task):
) += 1

reached_max_executions = (self.num_task_executions[task.driver_id().id(
)][function_id.id()] == self.function_properties[task.driver_id().id()]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

task.driver_id(), task.task_id() and function_id.id() seem to be used multiple times - can consider abstracting into variable to decrease verbosity.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed for task.driver_id(). It didn't obviously make sense to me for the others, but I do want to do a subsequent PR cleaning up some of this stuff (but not super sure what the scope of that is yet).



def remote(*args, **kwargs):
"""This decorator is used to define remote functions and to define actors.
worker = ray.worker.global_worker
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

get_global_worker()?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed, thx

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5380/
Test PASSed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/5382/
Test PASSed.

@pcmoritz pcmoritz merged commit 8fbb884 into ray-project:master May 14, 2018
@pcmoritz pcmoritz deleted the workercleanups branch May 14, 2018 21:35
alok added a commit to alok/ray that referenced this pull request May 14, 2018
* master:
  Create RemoteFunction class, remove FunctionProperties, simplify worker Python code. (ray-project#2052)
  Don't crash on duplicate actor notifications (ray-project#2043)
  Fixed attribute name in code example (ray-project#2054)
  [xray] Add Travis build for testing xray on Linux. (ray-project#2047)
  Added missing comma to code example (ray-project#2050)
  Use more CPUs for testMultipleWaitsAndGets. (ray-project#2051)
  use jobid_nil (ray-project#2044)
  Fix typo in tune. (ray-project#2046)
  Fix error in api.rst. (ray-project#2048)
  Improve shared_ptr usage (ray-project#2030)
alok added a commit to alok/ray that referenced this pull request May 15, 2018
* master:
  Create RemoteFunction class, remove FunctionProperties, simplify worker Python code. (ray-project#2052)
  Don't crash on duplicate actor notifications (ray-project#2043)
  Fixed attribute name in code example (ray-project#2054)
  [xray] Add Travis build for testing xray on Linux. (ray-project#2047)
  Added missing comma to code example (ray-project#2050)
  Use more CPUs for testMultipleWaitsAndGets. (ray-project#2051)
  use jobid_nil (ray-project#2044)
  Fix typo in tune. (ray-project#2046)
  Fix error in api.rst. (ray-project#2048)
  Improve shared_ptr usage (ray-project#2030)
alok added a commit to alok/ray that referenced this pull request May 15, 2018
* fix-a3c-torch:
  Fix shape error in conv nets
  Ensure that values are flat list
  rm all use of torch Variables
  Create RemoteFunction class, remove FunctionProperties, simplify worker Python code. (ray-project#2052)
  Don't crash on duplicate actor notifications (ray-project#2043)
  rm unnecessary Variable wrapper
  Fixed attribute name in code example (ray-project#2054)
  [xray] Add Travis build for testing xray on Linux. (ray-project#2047)
  Added missing comma to code example (ray-project#2050)
  Use more CPUs for testMultipleWaitsAndGets. (ray-project#2051)
  use jobid_nil (ray-project#2044)
  Fix typo in tune. (ray-project#2046)
  Fix error in api.rst. (ray-project#2048)
  Improve shared_ptr usage (ray-project#2030)
  replace deprecated function
  Fmt
  Fix shapes of tensors
  Rename argument name to out_size
  Use correct pytorch functions
  Use F.softmax instead of a pointless network layer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants