Skip to content

KI Sub-groups#668

Open
christiangnrd wants to merge 7 commits into
JuliaGPU:mainfrom
christiangnrd:subgroups
Open

KI Sub-groups#668
christiangnrd wants to merge 7 commits into
JuliaGPU:mainfrom
christiangnrd:subgroups

Conversation

@christiangnrd

@christiangnrd christiangnrd commented Dec 23, 2025

Copy link
Copy Markdown
Member

Includes #682

@github-actions

github-actions Bot commented Dec 23, 2025

Copy link
Copy Markdown
Contributor

Your PR no longer requires formatting changes. Thank you for your contribution!

@christiangnrd christiangnrd mentioned this pull request Jan 2, 2026
@christiangnrd christiangnrd marked this pull request as draft January 3, 2026 19:47
Comment thread test/intrinsics.jl
@christiangnrd christiangnrd force-pushed the subgroups branch 7 times, most recently from daea025 to 6343fd2 Compare January 7, 2026 16:55
@codecov

codecov Bot commented Jan 7, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 57.77778% with 19 lines in your changes missing coverage. Please review.
✅ Project coverage is 52.24%. Comparing base (6c027c7) to head (82d32db).

Files with missing lines Patch % Lines
src/pocl/backend.jl 42.30% 15 Missing ⚠️
src/intrinsics.jl 0.00% 3 Missing ⚠️
src/pocl/compiler/compilation.jl 90.00% 1 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #668      +/-   ##
==========================================
+ Coverage   52.18%   52.24%   +0.06%     
==========================================
  Files          22       22              
  Lines        1694     1736      +42     
==========================================
+ Hits          884      907      +23     
- Misses        810      829      +19     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@christiangnrd christiangnrd force-pushed the subgroups branch 5 times, most recently from f0a545c to 8858825 Compare February 19, 2026 23:54
@christiangnrd christiangnrd marked this pull request as ready for review February 20, 2026 00:13
@christiangnrd christiangnrd force-pushed the subgroups branch 3 times, most recently from 7ddba7e to 8de2c42 Compare March 24, 2026 10:29
@JuliaGPU JuliaGPU deleted a comment from github-actions Bot Mar 24, 2026
@christiangnrd christiangnrd force-pushed the subgroups branch 2 times, most recently from 409a1e7 to 08a8130 Compare May 28, 2026 22:53
@christiangnrd christiangnrd force-pushed the subgroups branch 5 times, most recently from b1d526f to 2f02082 Compare June 4, 2026 18:00
@christiangnrd

Copy link
Copy Markdown
Member Author

This is mostly ready except that for correctness of algorithms, I think it requires us to know the subgroup size of a device at compile time. I tried to force a known subgroup size for openCL/pocl, but that seems to have broken on-kernel rand. I'm not sure how to best approach this. I currently work around this in OpenCL.jl to test that the rest works by forcing a workgroup size in the KI launch interface so that the non-KI rand kernel doesn't encounter this issue, but that's a hack.

I think the ideal solution would be to have get_num_sub_groups always return at least 1 so we can force a sub-group size.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants