Open
Description
Background
I am currently trying to use memprof to optimize my program, but the test did not find obvious performance gains. I now hope to develop a tool to observe whether the optimization is really effective.
My idea is to use memory-profile
and memory-profile-use
at the same time, so that the memory access after memprof optimization can be collected. Then do some post-processing on this information to determine whether the correct clone is made. Furthermore, can a heat map be drawn based on the memory access situation, similar to BOLT.
Question
- I made the first attempt and found that
-fmemory-profile
and-fmemory-profile-use
cannot be used at the same time. Is this not allowed by design? - Are there other observation tools? I looked through some options and only found
memprof-export-to-dot
, but this can only prove whether the clone is done, and cannot reflect whether the clone result is good enough.