Skip to content

Commit 5b19b93

Browse files
authored
[ROCm][Kernel] Using the correct warp_size value
1 parent 75404d0 commit 5b19b93

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

csrc/moe/moe_align_sum_kernels.cu

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -207,8 +207,8 @@ __global__ void sgl_moe_align_block_size_kernel(
207207
__shared__ int32_t shared_counts[32][8];
208208
__shared__ int32_t local_offsets[256];
209209

210-
const int warp_id = threadIdx.x / WARP_SIZE;
211-
const int lane_id = threadIdx.x % WARP_SIZE;
210+
const int warp_id = threadIdx.x / 32;
211+
const int lane_id = threadIdx.x % 32;
212212
const int experts_per_warp = 8;
213213
const int my_expert_start = warp_id * experts_per_warp;
214214

0 commit comments

Comments
 (0)