LLVM pointer range loop / autovectorization regression

The benchmarks in crate [matrixmultiply](https://github.com/bluss/matrixmultiply) version 0.1.8 degrade with MIR enabled. (commit https://github.com/bluss/matrixmultiply/commit/3d83647e7a4366e4483d711326f0f5cc07a61090)

Tested using `rustc 1.12.0-nightly (1deb02ea6 2016-08-12)`.

Typical output:

```
// -C target-cpu=native
test mat_mul_f32::m127             ... bench:   2,703,773 ns/iter (+/- 636,432)
// -Z orbit=off -C target-cpu=native
test mat_mul_f32::m127             ... bench:     648,817 ns/iter (+/- 22,379)
```

Sure, the matrix multiplication kernel uses some [major muckery](https://github.com/bluss/matrixmultiply/blob/3d83647e7a4366e4483d711326f0f5cc07a61090/src/sgemm_kernel.rs#L65-L92) that it expects the compiler to optimize down and autovectorize, but since it technically is a regression, it gets a report.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM pointer range loop / autovectorization regression #35662

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

LLVM pointer range loop / autovectorization regression #35662

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions