Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workaround for a misaligned access in read_csv on some CUDA versions #17477

Merged

Conversation

vuule
Copy link
Contributor

@vuule vuule commented Dec 2, 2024

Description

Use a global array instead of a shared memory array in the gather_row_offsets_gpu kernel.

Impact on the kernel performance is less than 5%, and this kernel takes very little portion of the total read_csv execution time - impact on the performance is negligible.

Also modified functions that take this array to take a device_span instead on a plain pointer.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Copy link

copy-pr-bot bot commented Dec 2, 2024

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Dec 2, 2024
@vuule vuule added bug Something isn't working non-breaking Non-breaking change labels Dec 2, 2024
@vuule vuule changed the title Workaround for a misaligned access issue on some CUDA versions Workaround for a misaligned access in read_csv on some CUDA versions Dec 2, 2024
@vuule
Copy link
Contributor Author

vuule commented Dec 2, 2024

/ok to test

@vuule
Copy link
Contributor Author

vuule commented Dec 3, 2024

/ok to test

@vuule vuule marked this pull request as ready for review December 3, 2024 17:41
@vuule vuule requested a review from a team as a code owner December 3, 2024 17:41
@vuule vuule requested review from vyasr and davidwendt December 3, 2024 17:41
@vuule
Copy link
Contributor Author

vuule commented Dec 3, 2024

/merge

@rapids-bot rapids-bot bot merged commit beb4296 into rapidsai:branch-25.02 Dec 3, 2024
110 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants