-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Fuse transform
and copy_if
operations in quadtree_point_in_polygon
#559
[FEA] Fuse transform
and copy_if
operations in quadtree_point_in_polygon
#559
Conversation
…/small-quadtree-pip-perf-improvement
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know the whole process of the quadtree PIP algorithm yet but I can see the improvement here combines the counting iterator and the transform call into a counting transform iterator and reduces a kernel call.
What GPU are the perf numbers from? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fusion!
rerun tests |
@harrism Those numbers are from running the NYC taxi dataset benchmark on my local Quadro 8k. |
rerun tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I still approve. :)
Once you have A100 benchmark comparison, please add here.
@gpucibot merge |
Fuse
transform
andcopy_if
operations inquadtree_point_in_polygon
for a modest performance improvement.branch-22.08
:This PR: