Closed
Description
It seems we forgot to measure the compile time impact of the new MT combiner. When it was enabled for AArch64 in July it resulted in a large -Os 9% regression on CTMark/sqlite3 (and others too but I'm just using sqlite3 as a test case). Some time soon afterwards that regression went down to 3-4% but I haven't identified the commit responsible for that improvement.
However, even 3% is a very large CT regression.
The measurements were taken using a release + noasserts build of clang, without LTO/PGO on trunk.
Unfortunately some quick profiling doesn't show much except that ExecuteMatchTable()
is taking most of the time in the combiner, which isn't surprising.