-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor functions in join.py
to accept GeoSeries Input
#948
Refactor functions in join.py
to accept GeoSeries Input
#948
Conversation
Co-authored-by: H. Thomson Comer <thomcom@gmail.com>
…into feature/geoseries_interface/join
…ature/geoseries_interface/join
…into feature/geoseries_interface/join
@cuda.jit | ||
def binarize(in_col, out, width): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Whoah. Shoud we move this stuff to libcuspatial?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this benefit libcuSpatial? I don't see a usage of this other than converting the result of Point in Polygon to dataframe.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doing it faster benefits cuSpatial users. Generally Numba and Python add a lot of overhead.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can probably punt to a future optimization. Another question is whether we should change the point in polygon interface, or simply leave this as a cuda kernel bind to python.
This might also have bigger question to do with the PiP interface: currently we use bits to encode the output - this is very efficient in memory but eventually we still need to convert them back to int8 arrays as shown here. If no one needs the memory saving benefit, should we keep it? Perhaps we should change the interface to return boolean array directly?
/merge |
Description
This PR refactors all functions in
join.py
to accept GeoSeries as input.closes #940, closes #939, closes #941, closes #942
Checklist