Some tests use compilation flags which are not used in the production (like -d:danger or -mbranches-within-32B-boundaries or -fbounds-check=off) thus not giving the realistic results. README should be up-to-dated with the proper instructions to not use non-production flags, and the flags themselves should be up-to-dated too.