Remove CUDA whole compilation ODR violations #16603

robertmaynard · 2024-08-19T19:24:00Z

Description

CUDA whole compilation mode requires that all kernels are only launched from TUs that compile them. Previously libcudf would compile a subset of kernels in separate TUs from where they are launched.
To keep compile times ( and library size ) as low as possible I have introduced a single C++ function call between the original call site and the kernel launch. In testing this neglibile differences on compile time and binary size.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

vyasr

IIUC this is basically just adding an extra level of indirection so that the kernels are only compiled once and hidden beneath a single C++ launch function that can be called in different TUs? If so, LGTM.

robertmaynard · 2024-08-22T19:44:02Z

IIUC this is basically just adding an extra level of indirection so that the kernels are only compiled once and hidden beneath a single C++ launch function that can be called in different TUs? If so, LGTM.

That is correct

robertmaynard · 2024-08-26T14:21:44Z

/merge

robertmaynard added 4 commits August 19, 2024 13:28

compute_mixed_join_output_size no longer has ODR violation

e8ff0d8

mixed_join_semi no longer has ODR violation

1e7cc97

mixed_join no longer has ODR violation

4c34e3a

Correct symbol visibility issues

8e5f0f7

robertmaynard added bug Something isn't working non-breaking Non-breaking change labels Aug 19, 2024

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Aug 19, 2024

Merge branch 'branch-24.10' into bug/remove_cuda_odr_violations

a9cb341

robertmaynard marked this pull request as ready for review August 20, 2024 15:17

robertmaynard requested a review from a team as a code owner August 20, 2024 15:17

robertmaynard requested review from harrism and zpuller August 20, 2024 15:17

harrism approved these changes Aug 21, 2024

View reviewed changes

vyasr approved these changes Aug 22, 2024

View reviewed changes

rapids-bot bot merged commit 96f2cc5 into rapidsai:branch-24.10 Aug 26, 2024
98 checks passed

robertmaynard deleted the bug/remove_cuda_odr_violations branch August 26, 2024 14:22

tgujar mentioned this pull request Aug 27, 2024

Occupancy improvement for Hash table build #15700

Open

3 tasks

abellina mentioned this pull request Aug 30, 2024

[BUG] illegal access error in mixed_join after ODR cleanup PR #16706

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove CUDA whole compilation ODR violations #16603

Remove CUDA whole compilation ODR violations #16603

robertmaynard commented Aug 19, 2024

vyasr left a comment

robertmaynard commented Aug 22, 2024

robertmaynard commented Aug 26, 2024

Remove CUDA whole compilation ODR violations #16603

Remove CUDA whole compilation ODR violations #16603

Conversation

robertmaynard commented Aug 19, 2024

Description

Checklist

vyasr left a comment

Choose a reason for hiding this comment

robertmaynard commented Aug 22, 2024

robertmaynard commented Aug 26, 2024