bugfix: fix the misaligned address bug of norm kernels for certain shapes #636

yzh119 · 2024-11-25T00:06:04Z

This PR fixes the issue #634, which is brought by #592 .
If we want to use 16-bytes vectorized read/write, we need to confirm the address is aligned to 16 bytes.
When num_warps is not a multiple of 4 (4*sizeof(float) = 16), the address of smem + num_warps might not align to 16 bytes.

We can fix this by shifting the start offset of vectorized read/write to smem + ceil_div(num_warps, 4) * 4 to force the alignment.

cc @ovowei @Abatom

…646) Fix smem_size in FusedAddRMSNorm which is missed in #636 Fix issue #645

upd

de9a3c5

yzh119 merged commit db9c48d into main Nov 25, 2024

yzh119 mentioned this pull request Nov 25, 2024

[Bug] fused_add_rmsnorm Fails Due to Misaligned Address #634

Closed

Atream mentioned this pull request Dec 4, 2024

PR 636 miss fix for FusedAddRMSNorm function #645

Closed

Atream added a commit to kvcache-ai/custom_flashinfer that referenced this pull request Dec 4, 2024

fix smem_size in FusedAddRMSNorm which is missed in flashinfer-ai#636

278614b

Atream mentioned this pull request Dec 4, 2024

fix smem_size in FusedAddRMSNorm which is missed in PR #636 #646

Merged

yzh119 pushed a commit that referenced this pull request Dec 4, 2024

bugfix: fix smem_size in FusedAddRMSNorm which is missed in PR #636 (#…

b577710

…646) Fix smem_size in FusedAddRMSNorm which is missed in #636 Fix issue #645

zhyncs deleted the bugfix-634 branch December 12, 2024 06:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bugfix: fix the misaligned address bug of norm kernels for certain shapes #636

bugfix: fix the misaligned address bug of norm kernels for certain shapes #636

Uh oh!

yzh119 commented Nov 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bugfix: fix the misaligned address bug of norm kernels for certain shapes #636

bugfix: fix the misaligned address bug of norm kernels for certain shapes #636

Uh oh!

Conversation

yzh119 commented Nov 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants