-
Notifications
You must be signed in to change notification settings - Fork 624
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cuda source cleanup , refactor and fixes #1328
Cuda source cleanup , refactor and fixes #1328
Conversation
Thanks! We actually just recently were discussing some cleanup here so this fits nicely timing-wise! |
Thanks for the PR, really appreciate the pro-activity in helping us clean things up! I just had a chat with Tim and he reviewed the PR and said everything looks good to him. I see you put the PR as draft, are there any more changes coming or can wrap things up? |
Hi @Titus-von-Koeller , I am planning to wrap this up in a couple of hours from now. |
@Titus-von-Koeller this is ready for review now, thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Thanks!
6bef412
into
bitsandbytes-foundation:main
Cuda source cleanup , refactor and fixes (bitsandbytes-foundation#1328) * remove kcompress * fix initial template call * fix function name * remove vector load * cleanup reduce & rearrange * format
This is a draft Pr for cleanup of cuda kernels, refactor template assignments and fix some AOT issues.
The cuda kernels have some scope of cleanup in terms of redundant code pathways and unused methods.
cc @matthewdouglas , @Titus-von-Koeller
@TimDettmers (pinging for awareness).