Workaround for a misaligned access in `read_csv` on some CUDA versions #17477

vuule · 2024-12-02T19:43:31Z

Description

Use a global array instead of a shared memory array in the gather_row_offsets_gpu kernel.

Impact on the kernel performance is less than 5%, and this kernel takes very little portion of the total read_csv execution time - impact on the performance is negligible.

Also modified functions that take this array to take a device_span instead on a plain pointer.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2024-12-02T19:43:34Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

vuule · 2024-12-02T21:26:12Z

/ok to test

vuule · 2024-12-03T00:28:56Z

/ok to test

vuule · 2024-12-03T18:05:19Z

/merge

use global instead of shared mem for ctxtree

3cb53f5

github-actions bot assigned vuule Dec 2, 2024

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Dec 2, 2024

vuule added bug Something isn't working non-breaking Non-breaking change labels Dec 2, 2024

vuule changed the title ~~Workaround for a misaligned access issue on some CUDA versions~~ Workaround for a misaligned access in read_csv on some CUDA versions Dec 2, 2024

revert divceil

f1e6ea7

vuule marked this pull request as ready for review December 3, 2024 17:41

vuule requested a review from a team as a code owner December 3, 2024 17:41

vuule requested review from vyasr and davidwendt December 3, 2024 17:41

bdice approved these changes Dec 3, 2024

View reviewed changes

davidwendt approved these changes Dec 3, 2024

View reviewed changes

rapids-bot bot merged commit beb4296 into rapidsai:branch-25.02 Dec 3, 2024
110 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workaround for a misaligned access in `read_csv` on some CUDA versions #17477

Workaround for a misaligned access in `read_csv` on some CUDA versions #17477

vuule commented Dec 2, 2024 •

edited

Loading

copy-pr-bot bot commented Dec 2, 2024

vuule commented Dec 2, 2024

vuule commented Dec 3, 2024

vuule commented Dec 3, 2024

Workaround for a misaligned access in read_csv on some CUDA versions #17477

Workaround for a misaligned access in read_csv on some CUDA versions #17477

Conversation

vuule commented Dec 2, 2024 • edited Loading

Description

Checklist

copy-pr-bot bot commented Dec 2, 2024

vuule commented Dec 2, 2024

vuule commented Dec 3, 2024

vuule commented Dec 3, 2024

Workaround for a misaligned access in `read_csv` on some CUDA versions #17477

Workaround for a misaligned access in `read_csv` on some CUDA versions #17477

vuule commented Dec 2, 2024 •

edited

Loading