-
Notifications
You must be signed in to change notification settings - Fork 37
Open
Labels
good first issuewanna join the project? This may be a good starting pointwanna join the project? This may be a good starting pointperformanceHow fast is it? Make it go faster!How fast is it? Make it go faster!
Description
This is a spinoff of the Power9 issue #223.
I realised that adding -flto there (gcc link time optimizations) gains almost a factor 4 for scalar C++ code (and more than a factor 2 for simd code). I am using gcc8 there. Compare:
Amongst the things to be understood:
- what does lto really involve in our case
- does it also provide a benefit for x86 (or is it only on power9)
- disadvantages? keep in mind the LHC experiments have their own linking ways, eg fpic etc
- try out a newer gcc compiler (eg is the lto-dump tool only available with gcc10?)
- how to interpret objdump when -flto is switcehed on
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
good first issuewanna join the project? This may be a good starting pointwanna join the project? This may be a good starting pointperformanceHow fast is it? Make it go faster!How fast is it? Make it go faster!