feat: implement quantized_matmul with typed CPU implementation, supporting for dispatching different precisions #77

Elubrazione · 2025-10-18T12:22:23Z

Add complete quantized_matmul_impl_typed template function for CPU (float16, float32, and bfloat16).
Add fp32 test cases for quantized_matmul.
Relax float32 tolerance in test utils.

- Add complete quantized_matmul_impl_typed template function for CPU, which support float16, float32, and bfloat16 data types - Add float32 test cases for quantized_matmul - Adjust float32 tolerance in test utils for better precision

skyzh

Thanks!!

skyzh approved these changes Oct 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: implement quantized_matmul with typed CPU implementation, supporting for dispatching different precisions #77

feat: implement quantized_matmul with typed CPU implementation, supporting for dispatching different precisions #77

Uh oh!

Elubrazione commented Oct 18, 2025

Uh oh!

skyzh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: implement quantized_matmul with typed CPU implementation, supporting for dispatching different precisions #77

Are you sure you want to change the base?

feat: implement quantized_matmul with typed CPU implementation, supporting for dispatching different precisions #77

Uh oh!

Conversation

Elubrazione commented Oct 18, 2025

Uh oh!

skyzh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants