Skip to content

Conversation

@abenmao
Copy link
Contributor

@abenmao abenmao commented Jul 3, 2025

No description provided.

@abenmao abenmao requested a review from a3213105 July 8, 2025 02:03
@abenmao abenmao force-pushed the perf/e4m3/pack branch 2 times, most recently from f576f44 to 05e32c8 Compare July 8, 2025 05:41
@abenmao abenmao force-pushed the perf/e4m3/pack branch 2 times, most recently from a9c7641 to b4a4193 Compare July 21, 2025 05:12
int amx_cols = (int)((N + 63) / 64) * 64;
if (!weight.isShadow()) weight.Resize(amx_rows, amx_cols);
memset(weight.Data(), 0, sizeof(e4m3_t) * amx_rows * amx_cols);
int blockSize = 32;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rename blockSize to packedBlockSize to avoid potential naming conflicts with blocksize in the future "assert(blockSize == 128);"

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done~

@a3213105 a3213105 self-requested a review July 21, 2025 06:36
@abenmao abenmao merged commit 820dab5 into intel:main Jul 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants