Skip to content

Investigate link time optimizations (-flto) and inlining #229

@valassi

Description

@valassi

This is a spinoff of the Power9 issue #223.

I realised that adding -flto there (gcc link time optimizations) gains almost a factor 4 for scalar C++ code (and more than a factor 2 for simd code). I am using gcc8 there. Compare:

Amongst the things to be understood:

  • what does lto really involve in our case
  • does it also provide a benefit for x86 (or is it only on power9)
  • disadvantages? keep in mind the LHC experiments have their own linking ways, eg fpic etc
  • try out a newer gcc compiler (eg is the lto-dump tool only available with gcc10?)
  • how to interpret objdump when -flto is switcehed on

Metadata

Metadata

Assignees

Labels

good first issuewanna join the project? This may be a good starting pointperformanceHow fast is it? Make it go faster!

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions