Skip to content

Commit

Permalink
fix(permute.h): incorrect comment in Tensor5DPermute20314 (NVIDIA#637)
Browse files Browse the repository at this point in the history
* fix(permute.h): incorrect comment in `Tensor5DPermute20314`

* typo in usage in example 39
  • Loading branch information
Enter-tainer authored Sep 22, 2022
1 parent 97bff52 commit 7a458f0
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion examples/39_gemm_permute/gemm_permute.cu
Original file line number Diff line number Diff line change
Expand Up @@ -224,7 +224,7 @@ struct Options {
<< " permute([0, 2, 1, 3]) to be in shape of [B/D1, M, D1, N].\n\n"
<< " 2) This example also profiles the performance of a normal GEMM kernel with output as permuted 5D Tensor."
<< " The GEMM matrix output in shape of [M, N] is reshaped as [M/T1, T1, T2, T3, N/T2/T3] and then permuted"
<< " with permute([2, 0, 3, 1, 4]) to be in shape of [T2, M/T1, T3, T1, N//T2/T3].\n\n"
<< " with permute([2, 0, 3, 1, 4]) to be in shape of [T2, M/T1, T3, T1, N/T2/T3].\n\n"
<< " Note: D1, T1, T2, T3 are compile-time constants defined in gemm_permute.cu\n\n"
<< "Options:\n\n"
<< " --help If specified, displays this usage statement.\n\n"
Expand Down
2 changes: 1 addition & 1 deletion include/cutlass/layout/permute.h
Original file line number Diff line number Diff line change
Expand Up @@ -254,7 +254,7 @@ class Tensor4DPermuteBMM0213 {
};

/// Permute layout function for 5-D permuted tensors with output matrix (dimension as [M, N]) reshaped
/// as [M/T1, T1, T2, T3, N/T3]. Then perform permute([2, 0, 3, 1, 4]) on the corresponding output tensor.
/// as [M/T1, T1, T2, T3, N/T2/T3]. Then perform permute([2, 0, 3, 1, 4]) on the corresponding output tensor.
template <int T1, int T2, int T3>
class Tensor5DPermute20314 {
public:
Expand Down

0 comments on commit 7a458f0

Please sign in to comment.