Skip to content

Perfomance rule for kron. #526

@yebai

Description

@yebai

kron is the most underperforming benchmark function and requires a performance rule.

│                   kron_sum │ 377.0 μs │     39.9 │    12.9 │       220.0 │     7.2 │
│              kron_view_sum │ 364.0 μs │     43.5 │    9.97 │       208.0 │    78.8 │

The following Zygote rule can be adapted for Mooncake

https://github.com/FluxML/Zygote.jl/blob/c0bfc26de90db3778e1f3953f04d489e3413127b/src/lib/array.jl#L358-L374

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions