Skip to content

Allow more tests to run with CUDA #131

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Dec 9, 2024
Merged

Allow more tests to run with CUDA #131

merged 2 commits into from
Dec 9, 2024

Conversation

blochl
Copy link
Collaborator

@blochl blochl commented Dec 9, 2024

Be more fine-grained about which tests are able to run with CUDA. Thanks @halaAwisat for the suggestion to do this and the testing.

Include ib_send_bw in CUDA tests, as it looks like with the recent versions of perftest
it is supported with CUDA.

Signed-off-by: Leonid Bloch <lbloch@nvidia.com>
ib_send_lat can actually work with CUDA, but with message sizes only
higher than 32 (otherwise libmlx5 will try to inline the data to a WQE,
which will cause the CPU to try to memcpy the CUDA buffer). So if before
all the ib_send_lat tests were skipped with CUDA, now only the ones with
the small message sizes will be.

Signed-off-by: Leonid Bloch <lbloch@nvidia.com>
@blochl blochl merged commit 00e74ec into main Dec 9, 2024
@blochl blochl deleted the lb-dev branch December 9, 2024 13:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants