Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix incorrect bias stride in matmul cutlass offload (apache#212)
This PR makes the cutlass codegen use the correct bias stride when bias has more than 2 dimensions. For example, if the input bias has shape (1, n, 4096), the original code will set `ldc` to 0, which produces incorrect result. cc @vinx13 @masahi
- Loading branch information