Add endpoints for heap profiling #2517

jsdt · 2025-03-27T19:11:15Z

Description of Changes

This adds endpoints to make heap profiling easier. By default, standalone will have jemalloc heap profiling enabled, but not active, which didn't show any overhead in my benchmarks.

If you activate profiling, with curl "localhost:3000/internal/heap/settings?enabled=true" -X POST, jemalloc will start sampling allocations. It will keep sampling until you call it again with enabled=true (or restart the process).

You can get a dump of the heap by GETing the internal/heap endpoint. By default, this will be in pprof format (see pprof for installing a tool to view it). If you use the format=flame query parameter, it will give a flame graph instead. You can easily view the flame graph in your browser by going to localhost:3000/internal/heap?format=flame

There is some additional overhead while profiling is activated, because we are collecting samples, and using memory to store those samples, but it should be safe to try this out.

Note that the jemalloc configurations settings are set with the _rjem_malloc_conf variable in the main file of standalone, but this can be overridden by setting the _RJEM_MALLOC_CONF environment variable. You can learn more about it here.

Extra background

Previously, it was possible to set up heap profiling by setting the environment variable _RJEM_MALLOC_CONF to something like prof:true,prof_active:true,lg_prof_sample:12,prof_prefix:/jemalloc-dumps/,lg_prof_interval:20, which would make the program sample allocations and dump a heap profile to disk periodically. Using these dumps was a little annoying with docker, because the pprof tool needs to use the original binary to give readable symbols.

To make it easier, this uses the jemalloc-pprof crate, which takes care of attaching symbols/backtraces for us. This means we can parse the output without needing access to the binary. This adds some extra overhead, but we can always remove it if we decide it is too slow.

API and ABI breaking changes

I added an /internal section of the http API (outside of v1), with the idea that this is an undocumented feature that we may change or remove at any time.

Expected complexity level and risk

The main risk here is around the overhead when using it. There is very little risk of performance issues while profiling is not active.

Testing

Tested manually locally, by curling the different endpoints, and also by setting _RJEM_MALLOC_CONF to prof:false to totally disable profiling.

crates/sdk/tests/test-client/src/module_bindings/mod.rs

bfops · 2025-03-28T21:25:58Z

Just to check with @jdetter as someone who ends up doing a lot of profiling - are you happy with the pprof choice here?

crates/standalone/src/main.rs

bfops · 2025-03-28T21:32:04Z

Tested manually locally, by curling the different endpoints, and also by setting _RJEM_MALLOC_CONF to prof:false to totally disable profiling.

to confirm, you did all of these?

Set _RJEM_MALLOC_CONF to prof:false to totally disable profiling
Called /internal/heap/settings?enabled=true and then verified that samples were being collected by calling /internal/heap
Called /internal/heap/settings?enabled=false and verified that samples were no longer being collected by calling /internal/heap
Called /internal/heap/settings and verified that it returned the current profiling enabled/disabled status

bfops · 2025-03-28T21:38:55Z

I left some questions but this broadly looks good to me. I can't really speak to the details of whether or not we're using the jemalloc-specific stuff correctly, but the fallout seems low if we've gotten something wrong.

jsdt · 2025-03-30T19:41:36Z

Yes, I have done all of those tests.

bfops

LGTM. I can't really speak to the details of whether or not we're using the jemalloc-specific stuff correctly, but the fallout seems low if we've gotten something wrong.

jsdt added 7 commits March 26, 2025 11:24

Add a jemalloc pprof endpoint

28cbf59

Add flame version.

3423f21

Add an internal heap profiling endpoint.

059c51f

Add the new file....

3bc73ee

Make heap profiling enabled but deactivated by default for standalone.

8d83bd8

cleanup

7cd9090

Fix the windows code.

78a35b4

bfops self-requested a review March 27, 2025 20:00

bfops reviewed Mar 28, 2025

View reviewed changes

crates/sdk/tests/test-client/src/module_bindings/mod.rs Outdated Show resolved Hide resolved

bfops reviewed Mar 28, 2025

View reviewed changes

crates/standalone/src/main.rs Show resolved Hide resolved

bfops reviewed Mar 28, 2025

View reviewed changes

crates/standalone/src/main.rs Show resolved Hide resolved

jsdt assigned bfops Mar 30, 2025

Add link to docs in comment.

c339b26

bfops added the release-any To be landed in any release window label Mar 31, 2025

Merge branch 'master' into jsdt/jemalloc-pprof

7c621ff

bfops approved these changes Mar 31, 2025

View reviewed changes

jsdt added this pull request to the merge queue Mar 31, 2025

Merged via the queue into master with commit 64aef29 Mar 31, 2025
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add endpoints for heap profiling #2517

Add endpoints for heap profiling #2517

Uh oh!

jsdt commented Mar 27, 2025

Uh oh!

Uh oh!

bfops commented Mar 28, 2025

Uh oh!

Uh oh!

Uh oh!

bfops commented Mar 28, 2025 •

edited

Loading

Uh oh!

bfops commented Mar 28, 2025

Uh oh!

jsdt commented Mar 30, 2025

Uh oh!

bfops left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add endpoints for heap profiling #2517

Add endpoints for heap profiling #2517

Uh oh!

Conversation

jsdt commented Mar 27, 2025

Description of Changes

Extra background

API and ABI breaking changes

Expected complexity level and risk

Testing

Uh oh!

Uh oh!

bfops commented Mar 28, 2025

Uh oh!

Uh oh!

Uh oh!

bfops commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bfops commented Mar 28, 2025

Uh oh!

jsdt commented Mar 30, 2025

Uh oh!

bfops left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bfops commented Mar 28, 2025 •

edited

Loading