Skip to content

finegrained-fp8: fused moe kernels#530

Draft
IlyasMoutawwakil wants to merge 3 commits into
huggingface:mainfrom
IlyasMoutawwakil:fp8-fused-moe
Draft

finegrained-fp8: fused moe kernels#530
IlyasMoutawwakil wants to merge 3 commits into
huggingface:mainfrom
IlyasMoutawwakil:fp8-fused-moe

Conversation

@IlyasMoutawwakil

@IlyasMoutawwakil IlyasMoutawwakil commented Apr 7, 2026

Copy link
Copy Markdown
Member
moe_tflops_grouped moe_tflops_batched faster and more accurate fused fp8 for both long context (grouped) and decode (batched)

@sayakpaul

Copy link
Copy Markdown
Member

@IlyasMoutawwakil anything blocking merge here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants