-
Notifications
You must be signed in to change notification settings - Fork 378
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
remove kv cache assert for old arch for upstream compatibility
#3942
opened Jun 26, 2026 by
ganyi1996ppo
Contributor
Loading…
1 task
[Triton] Add fused_gemm_a16w16_split_cat
#3940
opened Jun 25, 2026 by
rbrugaro-amd
Contributor
Loading…
Map top-left map to bottom-right for self-attn
#3939
opened Jun 25, 2026 by
Micky774
Contributor
Loading…
1 task
gate custom all-reduce on XGMI topology
#3938
opened Jun 25, 2026 by
skysnow2001
Contributor
Loading…
1 task done
Spatial Attention: XCD-aware spatial workgroup mapping for MHA and GQA (SWIZZLE=1)
#3936
opened Jun 25, 2026 by
mc186
Loading…
[test] test_topk_plain: parametrize sweep to fix collection-time OOM
#3934
opened Jun 25, 2026 by
JohnQinAMD
Contributor
Loading…
1 task
edit aiter_opus_plus.h using opus api instead of asm code
#3932
opened Jun 25, 2026 by
junhaha666
Contributor
Loading…
1 task
fix(quick_all_reduce): make flag sync CUDA-graph-safe
#3928
opened Jun 25, 2026 by
Jasen2201
Contributor
Loading…
change default pa reduce kernel from cxx to flydsl
ci:atom
#3923
opened Jun 25, 2026 by
Bernard-Liu
Contributor
Loading…
[OPUS][ATOM] gfx1250 (wave32/WMMA) NoPE-fp8/RoPE-bf16 sparse prefill (separate module)
#3922
opened Jun 25, 2026 by
carlushuang
Collaborator
Loading…
feat: update A8W8 MLA kernels to global-load ckv variant on gfx950
#3921
opened Jun 25, 2026 by
fangche123
Contributor
Loading…
1 task
[Bugfix] base_tuner: gfx-aware tuned CSV handling and column-safe row…
#3920
opened Jun 25, 2026 by
yzhou103
Contributor
Loading…
1 task
feat(gfx1151): allow gfx1151 in cpp_itfs JIT arch validation
#3919
opened Jun 25, 2026 by
carlushuang
Collaborator
Loading…
feat(gfx1151): add INT8 W8A8 GEMM default config
#3917
opened Jun 25, 2026 by
carlushuang
Collaborator
Loading…
CI: run vLLM DSv4 and MiniMax M3 on MI350X
ci:vllm
#3916
opened Jun 25, 2026 by
gyohuangxin
Member
Loading…
perf(unified_attention): use 8 warps for gfx1151 3D decode
#3915
opened Jun 25, 2026 by
carlushuang
Collaborator
Loading…
flydsl: gfx942 FP8 MQA logits indexer kernel (+ Triton FN/FNUZ fix)
#3913
opened Jun 25, 2026 by
akii96
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-22.