Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compile CUDA directly to cubin instead of ptx #587

Merged
merged 1 commit into from
May 23, 2022

Conversation

kris-rowe
Copy link
Member

Description

Since OCCA handles jitting and hashes based on the device architecture, we can compile CUDA code directly to cubin instead of PTX.

@kris-rowe kris-rowe marked this pull request as ready for review May 20, 2022 22:14
@codecov
Copy link

codecov bot commented May 20, 2022

Codecov Report

Merging #587 (e9d4f68) into development (70aa7df) will increase coverage by 0.00%.
The diff coverage is n/a.

Impacted file tree graph

@@             Coverage Diff              @@
##           development     #587   +/-   ##
============================================
  Coverage        77.31%   77.32%           
============================================
  Files              264      264           
  Lines            19539    19539           
============================================
+ Hits             15107    15108    +1     
+ Misses            4432     4431    -1     
Impacted Files Coverage Δ
src/occa/internal/lang/specialMacros.cpp 60.26% <0.00%> (+0.66%) ⬆️

@kris-rowe kris-rowe merged commit c8a6b7d into libocca:development May 23, 2022
@kris-rowe kris-rowe deleted the cuda-jit-to-cubin branch July 12, 2022 14:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant