Make trace:info(S, MFA, call_time | call_memory) not block all schedulers #10207

sverker · 2025-09-16T14:01:18Z

Another step in the quest of purging ERTS from BIFs that block all scheduler threads.

This affects both the new trace:info/3 and the old erlang:trace_info/2 when they are called to collect call_time and/or call_memory counters.

github-actions · 2025-09-16T14:02:18Z

CT Test Results

3 files 135 suites 49m 37s ⏱️
1 652 tests 1 595 ✅ 57 💤 0 ❌
2 290 runs 2 214 ✅ 76 💤 0 ❌

Results for commit 357949a.

♻️ This comment has been updated with latest results.

To speed up review, make sure that you have read Contributing to Erlang/OTP and that all checks pass.

See the TESTING and DEVELOPMENT HowTo guides for details about how to run test locally.

Artifacts

// Erlang/OTP Github Action Bot

Don't spend time and heap memory on items not asked for.

in preparation for block-less trace:info.

Save memory in most cases, but impose extra cpu (time) when tracing.

It should always be erts_active_bp_index ^ 1. Avoid tiny race when we had to change two atomics. We only call erts_staging_bp_ix() when we change trace settings.

Copilot

Pull Request Overview

This PR refactors the implementation of trace:info/3 and erlang:trace_info/2 to make them non-blocking when collecting call_time and call_memory counters, addressing the issue of these functions blocking all scheduler threads.

Refactored breakpoint data structures to eliminate the erts_staging_bp_index variable
Implemented non-blocking trace info collection using code barriers and process suspension/resumption
Reorganized hash table structures and functions for call time/memory tracing

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
erts/emulator/test/trace_call_time_SUITE.erl	Added new test cases to verify non-blocking behavior and robustness against process termination
erts/emulator/nifs/common/prim_tty_nif.c	Fixed assertion to include zero in valid character range
erts/emulator/beam/erl_trace.h	Removed staging breakpoint index variable
erts/emulator/beam/erl_trace.c	Removed staging breakpoint index variable
erts/emulator/beam/erl_init.c	Moved trace initialization after emulator initialization
erts/emulator/beam/erl_bif_trace.c	Major refactor to implement non-blocking trace info collection with trapping mechanisms
erts/emulator/beam/beam_bp.h	Refactored breakpoint data structures and function signatures
erts/emulator/beam/beam_bp.c	Implemented new hash table management and non-blocking collection algorithms
erts/emulator/beam/atom.names	Added new atom for trace info finish export

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

erts/emulator/beam/erl_bif_trace.c

Instead use this strategy 0. Seize code mod permission (as before) 1. Allocate temporary zeroed hashes for any traced calls that may happen during the call to trace:info. 2. Thread progress 3. Switch bp index to make the temp hashes active. 4. Thread progress. 5. Collect stats from the real hashes that are now unused and stable. 6. Switch back bp index to make the real hashes active again. 7. Thread progress. 8. Consolidate by collecting stats from the temp hashes into the active generation. 9. Deallocate the temp hashes and make the two halves of the breakpoint identical again using the same real hashes. 10. Build result from stats collected in step 5 11. Release code mod permission

with some talk about sessions and call_time and call_memory.

Copilot

Pull Request Overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-01T15:50:15Z

erts/emulator/beam/erl_bif_trace.c

    default:
-	goto error;
+        erts_exit(ERTS_ABORT_EXIT, "Invalid key\n");
    }


The error message 'Invalid key' is not helpful for debugging. It should specify what constitutes a valid key or include the actual invalid key value that was encountered.

Copilot · 2025-10-01T15:50:16Z

erts/emulator/beam/beam_bp.c

    erts_refc_init(&bdt->refc, 1);
    for (Uint i = 0; i < n; i++) {
-        bp_hash_init(&(bdt->hash[i]), 32);
+        bdt->threads[i] = NULL;  // allocate on demand


C++ style comment should be replaced with C style comment /* allocate on demand */ to maintain consistency with the C codebase.

Suggested change

bdt->threads[i] = NULL; // allocate on demand

bdt->threads[i] = NULL; /* allocate on demand */

Copilot · 2025-10-01T15:50:16Z

erts/emulator/beam/beam_bp.c

+    sys_memzero(hash, size);
    hash->n    = n;
    hash->used = 0;

-    hash->item = (bp_data_trace_item_t *)Alloc(size);
-    sys_memzero(hash->item, size);
-
-    for(i = 0; i < n; ++i) {
-	hash->item[i].pid = NIL;
+    for(Uint i = 0; i < n; ++i) {
+	hash->buckets[i].pid = NIL;


The hash table allocation uses sys_memzero to zero the entire structure including buckets, but then immediately sets hash->buckets[i].pid = NIL in a loop. This double initialization is inefficient - either use calloc-equivalent allocation or skip the memzero and only initialize necessary fields.

sverker added this to the OTP-29.0 milestone Sep 16, 2025

sverker self-assigned this Sep 16, 2025

sverker added team:VM Assigned to OTP team VM enhancement labels Sep 16, 2025

sverker added 5 commits September 16, 2025 17:40

erts: Make trace:info(S, MFA, Item) less wasteful

a9d9880

Don't spend time and heap memory on items not asked for.

erts: Refactor call time/memory trace pid hash structs

e0f7abd

in preparation for block-less trace:info.

erts: Fix faulty assert in prim_tty_nif:isprint/1

15c8722

erts: Create thread hashes for time/memory tracing on demand

62cfd55

Save memory in most cases, but impose extra cpu (time) when tracing.

erts: Remove erts_staging_bp_index

e09af26

It should always be erts_active_bp_index ^ 1. Avoid tiny race when we had to change two atomics. We only call erts_staging_bp_ix() when we change trace settings.

sverker force-pushed the sverker/erts/trace-no-block branch from 84b9e53 to 5fee623 Compare September 16, 2025 16:14

sverker added the testing currently being tested, tag is used by OTP internal CI label Sep 17, 2025

sverker requested a review from Copilot September 17, 2025 17:13

Copilot AI reviewed Sep 17, 2025

View reviewed changes

erts/emulator/beam/erl_bif_trace.c Show resolved Hide resolved

erts/emulator/beam/erl_bif_trace.c Outdated Show resolved Hide resolved

sverker added testing currently being tested, tag is used by OTP internal CI and removed testing currently being tested, tag is used by OTP internal CI labels Sep 29, 2025

sverker force-pushed the sverker/erts/trace-no-block branch 2 times, most recently from baded75 to f41cc8d Compare October 1, 2025 12:07

sverker requested a review from frazze-jobb October 1, 2025 12:13

sverker force-pushed the sverker/erts/trace-no-block branch 2 times, most recently from c9cfc5e to c8cb8dd Compare October 1, 2025 13:27

erts: Improve internal Tracing.md

357949a

with some talk about sessions and call_time and call_memory.

sverker force-pushed the sverker/erts/trace-no-block branch from c8cb8dd to 357949a Compare October 1, 2025 13:37

sverker requested a review from Copilot October 1, 2025 15:49

Copilot AI reviewed Oct 1, 2025

View reviewed changes

frazze-jobb approved these changes Oct 2, 2025

View reviewed changes

sverker added testing currently being tested, tag is used by OTP internal CI and removed testing currently being tested, tag is used by OTP internal CI labels Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make trace:info(S, MFA, call_time | call_memory) not block all schedulers #10207

Make trace:info(S, MFA, call_time | call_memory) not block all schedulers #10207

sverker commented Sep 16, 2025

Uh oh!

github-actions bot commented Sep 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Uh oh!

	bdt->threads[i] = NULL; // allocate on demand
	bdt->threads[i] = NULL; /* allocate on demand */

Make trace:info(S, MFA, call_time | call_memory) not block all schedulers #10207

Are you sure you want to change the base?

Make trace:info(S, MFA, call_time | call_memory) not block all schedulers #10207

Conversation

sverker commented Sep 16, 2025

Uh oh!

github-actions bot commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CT Test Results

Artifacts

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Sep 16, 2025 •

edited

Loading