Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix bug in single source GEMM with residual + streamk (NVIDIA#1249)
Followup to NVIDIA#1224. A change in the stream-k threadblock swizzle ctor since 3.3 breaks single source GEMM with fused epilogue and stream-k. Multi-source was already corrected. Co-authored-by: Ali Hassani <ahassanijr@gmail.com>
- Loading branch information