Add support for directly displaying FlameGraphs.jl graphs. #25

NHDaly · 2020-09-21T22:08:41Z

Add support for directly displaying FlameGraphs.jl graphs.

This makes PProf fit into the standard julia profiling infrastructure.

For example:

julia> using Profile, FlameGraphs, PProf, Serialization

julia> @profile peakflops()
1.1991880148053252e11

julia> fg = FlameGraphs.flamegraph()
Node(FlameGraphs.NodeData(ip:0x0, 0x00, 1:670))

julia> pprof(webport=11111, fg)
"profile.pb.gz"

julia> Main binary filename not available.
Serving web UI on http://localhost:11111

Unfortunately, PProf doesn't retain the ordering information that the FlameGraphs have, so it's not as good of a format for FlameGraphs directly.

But the graph view and source views are still useful, even when building from FlameGraphs.

This makes PProf fit into the standard julia profiling infrastructure.

Handle case where input FlameGraph data contains 0x0 pointers Use a fallback which is more robust: hash the whole StackFrame object if it doesn't have a valid Instruction Pointer and/or linfo MethodInstance.

…oo"`) Manually escape the strings before writing them to the Proto.

… `var"#foo"`)" This reverts commit e3f055c. Undo this, and fix it in upstream `google/pprof` instead

…em (e.g. `var"#foo"`)"" This reverts commit 7ec85c2.

Before, starting from leaves up, we would miss the exclusive time for parent frames that take longer than their children. Now, we walk from top-down, and record a span in these four cases: - If the node is a leaf - If there is a gap _before_ first child - If there is a gap _between_ children - If there is a gap _after_ the last child Then for each span, we walk *back* up the tree (😅) and record the stack trace. We could maybe be algorithmically more efficient with some thought? We could certainly be a constant factor more efficient. My main concern is walking down each node and then back up each stack, this might be O(n*h) where h is the height of the tree. Probably would be better to build this up as we go? But 🤷 we have to emit the whole stack every time, so I don't think there's any avoiding O(n*h) time.

NHDaly · 2020-10-06T03:31:13Z

Also, after my last rewrite, i have checked and confirmed that it does indeed produce almost exact same results whether building directly from Profile or going through FlameGraphs - so maybe we want to simplify our code and always go through FlameGraphs? I'm not sure! I kind of like having to separate implementations to validate in case either one gets an error, but ALSO i feel like FlameGraphs has many more eyes on it, and is therefore much more vetted 😅

I guess there are some inefficiencies of going through FlameGraphs, though, since we have to essentially deconstruct the aggregated view back into the samples view, so that's a bit disappointing. Oh, and also we lose individual samples, and can only report a span.

This reminds me:

TODO: the span in FlameGraphs.jl is in samples, not nanoseconds. So either we need to multiply by the period to get actual nanoseonds, or switch to reporting the duration in samples, if it will let us.

Here's the only one weird difference i found when comparing data from peakflops() that went through FlameGraphs vs the data that built went directly off of Profile (it's probably easier to see if you pull up the images side-by-side):

pprof(fg):

pprof():

I'm not sure if this is another instance of us incorrectly duplicating a frame or something weird? But when we build off of Profile directly we duplicate the next frame down, instead of having the correct frame there.

it seemed to just work out of the box when i reported them as `event` `count`, just like normal.

vchuravy

I don't really have sufficient experience with FlameGraphs.jl. It's a shame that you have to "unprocess" the graph. So I would definitly have both versions.

src/flamegraphs.jl

NHDaly · 2020-10-12T22:45:38Z

I don't really have sufficient experience with FlameGraphs.jl. It's a shame that you have to "unprocess" the graph. So I would definitly have both versions.

Yeah, agreed. But it does seem like this contains all of the same information we need, so it might be worth it to go this hop through FlameGraphs just to have a single, consistent transformation from Profile, which gets more eyes? 🤷

Also, I think the issue I showed in the images above is the only remaining issue before this is ready to merge!

vchuravy · 2020-10-12T23:45:16Z

I totally understand wanting to render FlameGraph data, especially since you can use it for other data sources.

In the long run I always wanted to include the JIT_PROFILE output for perf, and for that one likely would need to go from the raw form with the instruction pointers.

vchuravy

Also needs tests.

Project.toml

NHDaly · 2020-10-20T02:20:16Z

Okay, methinks this is good to go! :)

Thanks for the suggestions

NHDaly · 2020-10-20T02:21:20Z

After looking at the above discrepancy in the two images I posted, I think the FlameGraphs image is more likely to be right, and the PProf one is simply a dropped frame due to the inlining bug. It's maybe just that i ran this on an old commit (maybe on this branch) without merging in #27.

As of JuliaPerf/PProf.jl#25, and release v1.2.0, PProf.jl has supported displaying FlameGraphs directly, via: ```julia using PProf pprof(fg) ```

Add support for directly displaying FlameGraphs.jl graphs.

79f878d

This makes PProf fit into the standard julia profiling infrastructure.

NHDaly force-pushed the nhd-FlameGraphs branch from c067a71 to 79f878d Compare September 21, 2020 22:10

NHDaly added 7 commits September 22, 2020 12:41

Fix location IDs to be unique, and add C function support

f397549

Merge branch 'master' into nhd-FlameGraphs

beb7d9f

FlameGraphs: Robust against weird inputs.

c2d9db3

Handle case where input FlameGraph data contains 0x0 pointers Use a fallback which is more robust: hash the whole StackFrame object if it doesn't have a valid Instruction Pointer and/or linfo MethodInstance.

FlameGraphs: Handle function names with strings in them (e.g. `var"#f…

e3f055c

…oo"`) Manually escape the strings before writing them to the Proto.

Revert "FlameGraphs: Handle function names with strings in them (e.g.…

7ec85c2

… `var"#foo"`)" This reverts commit e3f055c. Undo this, and fix it in upstream `google/pprof` instead

Revert "Revert "FlameGraphs: Handle function names with strings in th…

f4ab2f9

…em (e.g. `var"#foo"`)"" This reverts commit 7ec85c2.

NHDaly force-pushed the nhd-FlameGraphs branch from 3d715b1 to 7811cbc Compare October 6, 2020 03:05

NHDaly requested a review from vchuravy October 6, 2020 05:01

NHDaly marked this pull request as ready for review October 6, 2020 05:01

NHDaly added 2 commits October 8, 2020 16:33

Fix FlameGraph printing to not skip root node in FlameGraph

ace2ce4

Switch FlameGraphs to report number of samples, not incorrect time.

238c229

it seemed to just work out of the box when i reported them as `event` `count`, just like normal.

vchuravy reviewed Oct 12, 2020

View reviewed changes

src/flamegraphs.jl Outdated Show resolved Hide resolved

Fix comment in src/flamegraphs.jl

2cb5782

vchuravy reviewed Oct 12, 2020

View reviewed changes

Project.toml Show resolved Hide resolved

NHDaly added 4 commits October 19, 2020 22:16

Fix bug in Flamegraphs: infinite loop when with_c=false

c89ddc1

Add FlameGraph tests for PProf

807ab9f

Improve pprof empty profile test to verify output is written

c2c648e

Add [compat] for FlameGraphs = 0.2

66bb2c6

NHDaly requested a review from vchuravy October 20, 2020 02:20

vchuravy approved these changes Oct 20, 2020

View reviewed changes

vchuravy merged commit c5594d2 into master Oct 20, 2020

vchuravy deleted the nhd-FlameGraphs branch October 20, 2020 03:32

NHDaly added a commit that referenced this pull request Oct 20, 2020

Bump to v1.2.0 after merging #25

883f7db

NHDaly mentioned this pull request Nov 28, 2020

Add PProf to the list of packages that support FlameGraphs timholy/FlameGraphs.jl#32

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for directly displaying FlameGraphs.jl graphs. #25

Add support for directly displaying FlameGraphs.jl graphs. #25

NHDaly commented Sep 21, 2020 •

edited

Loading

NHDaly commented Oct 6, 2020 •

edited

Loading

vchuravy left a comment

NHDaly commented Oct 12, 2020

vchuravy commented Oct 12, 2020 •

edited

Loading

vchuravy left a comment

NHDaly commented Oct 20, 2020

NHDaly commented Oct 20, 2020

Add support for directly displaying FlameGraphs.jl graphs. #25

Add support for directly displaying FlameGraphs.jl graphs. #25

Conversation

NHDaly commented Sep 21, 2020 • edited Loading

NHDaly commented Oct 6, 2020 • edited Loading

vchuravy left a comment

Choose a reason for hiding this comment

NHDaly commented Oct 12, 2020

vchuravy commented Oct 12, 2020 • edited Loading

vchuravy left a comment

Choose a reason for hiding this comment

NHDaly commented Oct 20, 2020

NHDaly commented Oct 20, 2020

NHDaly commented Sep 21, 2020 •

edited

Loading

NHDaly commented Oct 6, 2020 •

edited

Loading

vchuravy commented Oct 12, 2020 •

edited

Loading