Does link-time optimization help
Please include in the benchmarks, whether compiling with Link-Time-Optimization does help (and is measurable). Or whether switching gcc⇔clang does help. If you have problems compiling with LTO I can help you on the compilation.