perf: improve islock / llmq signing latency #6896

PastaPastaPasta · 2025-10-15T19:48:59Z

Issue being fixed or feature implemented

InstantSend / LLMQ signing is currently wait / loop driven. This PR converts this to notification driven. This allows us to immediately begin working on new signing or other operations instead of waiting. During very high load, this is unlikely to significantly affect throughput, however it will affect best case latency during low throughput.

📊 Mempool Performance Improvement (15 nodes local functional test measuring latency via ZMQ from mempool notification to islock / recsig (should reflect input lock time) notification)

Metric	Before (avg of 2 runs)	After (avg of 2 runs)	Improvement (%)
lock - min	307.26 ms	53.30 ms	−82.7%
lock - p50	537.28 ms	235.15 ms	−56.2%
lock - p90	718.17 ms	388.75 ms	−45.9%
lock - p99	813.85 ms	443.19 ms	−45.6%
lock - max	846.09 ms	561.26 ms	−33.6%
recsig - min	111.26 ms	46.55 ms	−58.2%
recsig - p50	367.82 ms	145.53 ms	−60.4%
recsig - p90	491.30 ms	246.39 ms	−49.9%
recsig - p99	562.76 ms	313.32 ms	−44.3%
recsig - max	603.32 ms	359.99 ms	−40.3%

Currently on main net, we see on average ~1.5-2 second time to lock transactions. Based on the 250ms average reduction locally, this implies that we may see 1.25 - 1.75 second average time to lock with this. Although, the reduction in latency may be more or less with significantly larger quorums.

Checklist:

Go over all the following points, and put an x in all the boxes that apply.

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have added or updated relevant unit/integration/functional/e2e tests
I have made corresponding changes to the documentation
I have assigned this pull request to a milestone (for repository code-owners and collaborators only)

…nsiveness Adjusted the timing intervals in the WorkThreadMain function of CSigSharesManager to 10 milliseconds for both message sending and work interruption checks. This change enhances the responsiveness of the signing shares manager by allowing more frequent processing of pending signature shares and message sending. This results in ~33% latency improvements in a contrived local latency functional test / benchmark from ~500ms to ~333ms

… CSigSharesManager Added a NotifyWorker function to both CInstantSendManager and CSigSharesManager to signal the worker thread when new work is available. This change improves the responsiveness of the worker threads by allowing them to wake up promptly when there are pending tasks, thus enhancing overall performance and reducing latency in processing instant send locks and signature shares.

github-actions · 2025-10-15T19:49:31Z

✅ No Merge Conflicts Detected

This PR currently has no conflicts with other open PRs.

coderabbitai · 2025-10-15T19:49:48Z

Walkthrough

Introduces epoch/condition-variable wake-up mechanisms across InstantSend, LLMQ signing, and sigshares: adds workMutex, workCv, workEpoch, and a NotifyWorker() helper; InterruptWorkerThread now notifies the CV. InstantSend replaces pair-based pending entries with instantsend::PendingISLockFromPeer (fields node_id, islock) and updates all insertion/lookup/removal call sites. Many state-mutating methods now call NotifyWorker(). Worker threads were refactored from sleep-based polling to epoch-aware condition-variable waits with deadline-based periodic wakeups and steady_clock timing for cleanup/retry scheduling.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The PR title "perf: improve islock / llmq signing latency" directly and clearly summarizes the main objective of the changeset. The raw summary confirms that the changes convert InstantSend/LLMQ signing from a wait/loop-driven model to a notification-driven model to improve responsiveness and latency. The title is concise, specific, and appropriately highlights the primary change from the developer's perspective without unnecessary noise.
Description Check	✅ Passed	The PR description is directly related to the changeset and accurately describes the conversion from wait/loop-driven to notification-driven architecture. It explains the motivation (allowing immediate work instead of waiting), expected impact (improved best-case latency), and provides performance benchmarks from local testing showing significant latency reductions across all metrics. The description aligns well with the technical changes detailed in the raw summary, such as the addition of NotifyWorker() calls, condition variables, and epoch-based synchronization across multiple files.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (11)

src/llmq/signing.cpp (3)

434-436: Move NotifyWorker() outside cs_pending to avoid waking into lock contention

You notify while still holding cs_pending. Wake-ups can immediately contend on the same lock the worker needs. Release the lock before notifying.

Apply:

-    LOCK(cs_pending);
-    if (pendingReconstructedRecoveredSigs.count(recoveredSig->GetHash())) {
+    {
+        LOCK(cs_pending);
+        if (pendingReconstructedRecoveredSigs.count(recoveredSig->GetHash())) {
             // no need to perform full verification
             LogPrint(BCLog::LLMQ, "CSigningManager::%s -- already pending reconstructed sig, signHash=%s, id=%s, msgHash=%s, node=%d\n", __func__,
                      recoveredSig->buildSignHash().ToString(), recoveredSig->getId().ToString(), recoveredSig->getMsgHash().ToString(), from);
-        return ret;
-    }
-
-    pendingRecoveredSigs[from].emplace_back(recoveredSig);
-    workEpoch.fetch_add(1, std::memory_order_acq_rel);
-    NotifyWorker();
+            return ret;
+        }
+        pendingRecoveredSigs[from].emplace_back(recoveredSig);
+    }
+    workEpoch.fetch_add(1, std::memory_order_release);
+    NotifyWorker();

651-653: Also notify after releasing cs_pending in PushReconstructedRecoveredSig

Same contention pattern; release cs_pending before bumping epoch/notify.

-    LOCK(cs_pending);
-    pendingReconstructedRecoveredSigs.emplace(std::piecewise_construct, std::forward_as_tuple(recoveredSig->GetHash()), std::forward_as_tuple(recoveredSig));
-    workEpoch.fetch_add(1, std::memory_order_acq_rel);
-    NotifyWorker();
+    {
+        LOCK(cs_pending);
+        pendingReconstructedRecoveredSigs.emplace(std::piecewise_construct, std::forward_as_tuple(recoveredSig->GetHash()), std::forward_as_tuple(recoveredSig));
+    }
+    workEpoch.fetch_add(1, std::memory_order_release);
+    NotifyWorker();

837-863: Epoch-based wait + deadline is solid; optional steady_clock clean-up

The wait logic is correct. Minor optional: store lastCleanupTime as a steady_clock timestamp (or compute deadline fully in steady_clock) to avoid system clock jumps influencing the cadence.

src/instantsend/instantsend.h (1)

66-70: LGTM; minor nits

Adding workMutex/workCv/workEpoch and waking on interrupt is correct.

Optional: mark NotifyWorker() noexcept and use memory_order_release for fetch_add (loads already use acquire).

Also applies to: 112-112, 140-143

src/instantsend/instantsend.cpp (2)

164-167: Notify after releasing locks to avoid waking into contention

These NotifyWorker() calls run while cs_pendingLocks (ProcessMessage), cs_nonLocked (RemoveNonLockedTx), or cs_pendingLocks (TryEmplacePendingLock) are still held. Wake-ups can contend immediately on the same locks.

Move notifications after the lock scopes.

@@
-    LOCK(cs_pendingLocks);
-    pendingInstantSendLocks.emplace(hash, std::make_pair(from, islock));
-    NotifyWorker();
+    {
+        LOCK(cs_pendingLocks);
+        pendingInstantSendLocks.emplace(hash, std::make_pair(from, islock));
+    }
+    NotifyWorker();
@@
-    LogPrint(BCLog::INSTANTSEND, "CInstantSendManager::%s -- txid=%s, pindexMined=%s\n", __func__,
-             tx->GetHash().ToString(), pindexMined ? pindexMined->GetBlockHash().ToString() : "");
-    NotifyWorker();
+    LogPrint(BCLog::INSTANTSEND, "CInstantSendManager::%s -- txid=%s, pindexMined=%s\n", __func__,
+             tx->GetHash().ToString(), pindexMined ? pindexMined->GetBlockHash().ToString() : "");
+    NotifyWorker();
@@
-    LOCK(cs_nonLocked);
+    LOCK(cs_nonLocked);
     ...
-    LogPrint(BCLog::INSTANTSEND, "CInstantSendManager::%s -- txid=%s, retryChildren=%d, retryChildrenCount=%d\n",
-             __func__, txid.ToString(), retryChildren, retryChildrenCount);
-    NotifyWorker();
+    LogPrint(BCLog::INSTANTSEND, "CInstantSendManager::%s -- txid=%s, retryChildren=%d, retryChildrenCount=%d\n",
+             __func__, txid.ToString(), retryChildren, retryChildrenCount);
+    // cs_nonLocked is released here (end of function scope) in current layout;
+    // if kept, wrap the critical section in its own block and notify after.
+    NotifyWorker();
@@
-    LOCK(cs_pendingLocks);
-    if (!pendingInstantSendLocks.count(hash)) {
-        pendingInstantSendLocks.emplace(hash, std::make_pair(id, islock));
-    }
-    NotifyWorker();
+    {
+        LOCK(cs_pendingLocks);
+        if (!pendingInstantSendLocks.count(hash)) {
+            pendingInstantSendLocks.emplace(hash, std::make_pair(id, islock));
+        }
+    }
+    NotifyWorker();

Also applies to: 561-562, 602-603, 633-634

934-967: Epoch-based wait loop is correct; optional deadline for periodic work

Looping while there’s pending work and otherwise waiting on epoch/interrupt is good. If InstantSend ever needs periodic timeouts without external triggers, consider a bounded wait (like signing.cpp’s next-deadline) to guarantee periodic wakeups.

Please confirm there are no time-based maintenance tasks in InstantSend that require periodic wakeups.

src/llmq/signing.h (1)

241-246: Worker wake-up API is consistent; minor polish

Adding workMutex/workCv/workEpoch and exposing NotifyWorker is consistent with usage in signing.cpp.

Optional: declare NotifyWorker noexcept and consider memory_order_release on fetch_add for clarity.

Also applies to: 251-251

src/llmq/signing_shares.cpp (4)

1502-1556: Worker loop: event-driven with deadlines; consider two small tweaks

The epoch/CV wait with deadline calculation is sound. Two optional improvements:

If no deadline is computed, prefer an indefinite wait (with predicate) over 10ms polling to reduce wakeups.
Use a single time source for recovery timing (either util::GetTimestd::chrono::milliseconds() everywhere or TicksSinceEpoch) for consistency.

Example for the first point:

-        if (next_deadline == std::chrono::steady_clock::time_point::max()) {
-            workCv.wait_for(l, std::chrono::milliseconds(10), [this, startEpoch]{
-                return bool(workInterrupt) || workEpoch.load(std::memory_order_acquire) != startEpoch;
-            });
-        } else {
+        if (next_deadline == std::chrono::steady_clock::time_point::max()) {
+            workCv.wait(l, [this, startEpoch]{
+                return bool(workInterrupt) || workEpoch.load(std::memory_order_acquire) != startEpoch;
+            });
+        } else {
             workCv.wait_until(l, next_deadline, [this, startEpoch]{
                 return bool(workInterrupt) || workEpoch.load(std::memory_order_acquire) != startEpoch;
             });
-        }
+        }

1563-1565: Notify outside cs_pendingSigns to minimize contention

Notify while not holding cs_pendingSigns, so the worker can grab it immediately.

 void CSigSharesManager::AsyncSign(const CQuorumCPtr& quorum, const uint256& id, const uint256& msgHash)
 {
-    LOCK(cs_pendingSigns);
-    pendingSigns.emplace_back(quorum, id, msgHash);
-    // Wake worker to handle new pending sign immediately
-    NotifyWorker();
+    {
+        LOCK(cs_pendingSigns);
+        pendingSigns.emplace_back(quorum, id, msgHash);
+    }
+    // Wake worker to handle new pending sign immediately
+    NotifyWorker();
 }

1690-1692: Release cs before notifying

Avoid notifying while holding cs; release first to reduce contention.

 void CSigSharesManager::ForceReAnnouncement(const CQuorumCPtr& quorum, Consensus::LLMQType llmqType, const uint256& id, const uint256& msgHash)
 {
     if (IsAllMembersConnectedEnabled(llmqType, m_sporkman)) {
         return;
     }
-
-    LOCK(cs);
-    auto signHash = SignHash(llmqType, quorum->qc->quorumHash, id, msgHash).Get();
-    if (const auto *const sigs = sigShares.GetAllForSignHash(signHash)) {
-        for (const auto& [quorumMemberIndex, _] : *sigs) {
-            // re-announce every sigshare to every node
-            sigSharesQueuedToAnnounce.Add(std::make_pair(signHash, quorumMemberIndex), true);
-        }
-    }
-    for (auto& [_, nodeState] : nodeStates) {
-        auto* session = nodeState.GetSessionBySignHash(signHash);
-        if (session == nullptr) {
-            continue;
-        }
-        session->knows.SetAll(false);
-        session->sendSessionId = UNINITIALIZED_SESSION_ID;
-    }
-    // Wake worker so announcements are sent promptly
-    NotifyWorker();
+    {
+        LOCK(cs);
+        auto signHash = SignHash(llmqType, quorum->qc->quorumHash, id, msgHash).Get();
+        if (const auto *const sigs = sigShares.GetAllForSignHash(signHash)) {
+            for (const auto& [quorumMemberIndex, _] : *sigs) {
+                sigSharesQueuedToAnnounce.Add(std::make_pair(signHash, quorumMemberIndex), true);
+            }
+        }
+        for (auto& [_, nodeState] : nodeStates) {
+            auto* session = nodeState.GetSessionBySignHash(signHash);
+            if (session == nullptr) continue;
+            session->knows.SetAll(false);
+            session->sendSessionId = UNINITIALIZED_SESSION_ID;
+        }
+    }
+    // Wake worker so announcements are sent promptly
+    NotifyWorker();
 }

1698-1701: Release cs before notifying

Same rationale as above; notify after releasing cs.

 MessageProcessingResult CSigSharesManager::HandleNewRecoveredSig(const llmq::CRecoveredSig& recoveredSig)
 {
-    LOCK(cs);
-    RemoveSigSharesForSession(recoveredSig.buildSignHash().Get());
-    // Cleaning up a session can free resources; wake worker to proceed
-    NotifyWorker();
+    {
+        LOCK(cs);
+        RemoveSigSharesForSession(recoveredSig.buildSignHash().Get());
+    }
+    // Cleaning up a session can free resources; wake worker to proceed
+    NotifyWorker();
     return {};
 }

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f170aed and 2f03126.

📒 Files selected for processing (6)

src/instantsend/instantsend.cpp (11 hunks)
src/instantsend/instantsend.h (3 hunks)
src/llmq/signing.cpp (5 hunks)
src/llmq/signing.h (2 hunks)
src/llmq/signing_shares.cpp (8 hunks)
src/llmq/signing_shares.h (3 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

src/**/*.{cpp,h,cc,cxx,hpp}

📄 CodeRabbit inference engine (CLAUDE.md)

src/**/*.{cpp,h,cc,cxx,hpp}: Dash Core C++ codebase must be written in C++20 and require at least Clang 16 or GCC 11.1
Dash uses unordered_lru_cache for efficient caching with LRU eviction

Files:

src/llmq/signing.cpp
src/llmq/signing.h
src/instantsend/instantsend.h
src/llmq/signing_shares.h
src/instantsend/instantsend.cpp
src/llmq/signing_shares.cpp

🔇 Additional comments (11)

src/llmq/signing.cpp (3)

526-533: Early-continue on pending queues is correct

Returning true when either queue still has items prevents unnecessary sleeping. Good.

585-592: Good: report more work when queues non-empty or batch threshold hit

This avoids idle sleeps under load. Looks correct.

831-832: Interrupt path properly wakes the worker

Calling workCv.notify_all() on interrupt is appropriate. LGTM.

src/instantsend/instantsend.cpp (1)

462-463: LGTM: timely worker wake-ups at key state transitions

Waking after mempool adds/removals, block connects/disconnects, ChainLock notifications, and tip updates looks correct.

Also applies to: 480-481, 511-512, 518-519, 639-640, 657-658

src/llmq/signing_shares.h (1)

383-387: Worker wake-up primitives look good; keep annotations or document; verify implementation exists

New workMutex/workCv/workEpoch and NotifyWorker are appropriate.

You removed thread-safety annotations from SignPendingSigShares/WorkThreadMain. Consider restoring them (or documenting internal locking) to keep static checks useful.

Ensure CSigSharesManager::NotifyWorker is defined in the .cpp and used at all state-mutating sites (message handlers, queue inserts, timers).

Run to verify definition and usages:

Also applies to: 494-496, 497-497

src/llmq/signing_shares.cpp (6)

218-220: Prompt shutdown wakeup is correct

Notifying the worker during interrupt ensures timely exit. Looks good.

301-304: Wake worker after inbound message processing

Good call to notify; inbound messages can create immediate work.

735-736: Batching announce + deferred notify is solid

Queuing under cs and notifying after unlock minimizes contention and spurious wakeups. Nice.

Also applies to: 744-745, 766-769

1496-1497: Notify after ban to reroute work

Waking the worker after marking a node banned is appropriate.

1589-1591: Post-sign notify to progress announcements/recovery

Good to wake after enqueuing new work.

1703-1709: Epoch + CV notify is appropriate for lossless wakeups

Atomic increment plus notify_one is correct for a single worker.

knst · 2025-10-16T08:12:13Z

src/instantsend/instantsend.cpp

-        }
+        if (fMoreWork) continue;
+        std::unique_lock<Mutex> l(workMutex);
+        workCv.wait(l, [this, startEpoch]{


is there any chance that event could be missing and thread will wait forever? For example, when app is terminating.

…nager Adjusted the formatting of the NotifyWorker function in CInstantSendManager for improved readability. This change includes consistent spacing and indentation, enhancing code maintainability without altering functionality.

…ingSigShares Modified the WorkThreadMain function in signing.h and SignPendingSigShares in signing_shares.h to specify exclusive lock requirements. This change enhances thread safety by ensuring proper locking mechanisms are in place, preventing potential race conditions during concurrent operations.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

src/llmq/signing_shares.h (2)
383-387: Concurrency primitives look right; please document lock ordering

Using Mutex + condition_variable_any is appropriate.

To avoid deadlocks, define/order: never block on workCv while holding cs or cs_pendingSigns; avoid taking workMutex while holding cs/cs_pendingSigns (or document a single consistent order if needed).

496-498: Restore thread-safety annotation on WorkThreadMain; mark NotifyWorker noexcept

The removed EXCLUSIVE_LOCKS_REQUIRED(!cs_pendingSigns) weakens static analysis. If WorkThreadMain still must not be called with cs_pendingSigns held, keep the attribute; otherwise use LOCKS_EXCLUDED(cs_pendingSigns).

NotifyWorker should be noexcept (only increments and notifies).

Apply this diff:
-    void WorkThreadMain(CConnman& connman, PeerManager& peerman);
-    void NotifyWorker();
+    void WorkThreadMain(CConnman& connman, PeerManager& peerman)
+        EXCLUSIVE_LOCKS_REQUIRED(!cs_pendingSigns);
+    void NotifyWorker() noexcept;

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dda0aef and 708fa0d.

📒 Files selected for processing (2)

src/llmq/signing.h (2 hunks)
src/llmq/signing_shares.h (3 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

src/llmq/signing.h

🧰 Additional context used

📓 Path-based instructions (1)

src/**/*.{cpp,h,cc,cxx,hpp}

📄 CodeRabbit inference engine (CLAUDE.md)

src/**/*.{cpp,h,cc,cxx,hpp}: Dash Core C++ codebase must be written in C++20 and require at least Clang 16 or GCC 11.1
Dash uses unordered_lru_cache for efficient caching with LRU eviction

Files:

src/llmq/signing_shares.h

🔇 Additional comments (2)

src/llmq/signing_shares.h (2)

29-29: Include OK for condition_variable_any

Correct header for std::condition_variable_any with custom Mutex.

494-495: Good: explicit “not-held” requirement

EXCLUSIVE_LOCKS_REQUIRED(!cs_pendingSigns) on SignPendingSigShares matches AsyncSign and prevents calls under that lock.

…hares.h Added exclusive lock requirements to the WorkThreadMain function in signing_shares.h to enhance thread safety. This change ensures that the function operates correctly under concurrent conditions, preventing potential race conditions.

UdjinM6 · 2025-10-17T11:33:02Z

running feature_llmq_signing.py, monitoring CPU levels:
develop: 5-10%
this PR: 100%

UdjinM6

119b81f should fix CPU issue but there is more, pls see below.

UdjinM6 · 2025-10-17T20:25:21Z

src/llmq/signing_shares.cpp

-            SendMessages(connman);
-            lastSendTime = TicksSinceEpoch<std::chrono::milliseconds>(SystemClock::now());
-        }
+        bool didSend = SendMessages(connman);


I think we had a cooldown for SendMessages() for a reason but I don't remember why exactly 🙈 All qsig* messages contain vectors, so maybe waiting a bit allowed us to utilise them better. Or maybe we simply did not want to push messages too often. Anyway, we need to confirm that there is no regressing in this part.

src/llmq/signing_shares.cpp

…antSendManager Updated the CInstantSendManager to use a new struct, PendingISLockFromPeer, for better clarity and type safety. This change replaces the use of std::pair for storing node ID and InstantSendLockPtr, enhancing code readability and maintainability across multiple functions handling instant send locks.

The 'pend' local variable in ProcessPendingInstantSendLocks was previously using the same data structure as pendingInstantSendLocks (a hash map). However, once we're in the processing step, we only iterate sequentially through the locks - there are no hash-based lookups. This commit changes 'pend' to use std::vector for better performance: - Improved cache locality with contiguous memory layout - Better CPU prefetching during iteration (3x through the data) - Eliminates hash map overhead (bucket allocation, pointer chasing) - Filtering step uses build-new-vector approach to maintain O(n) The typical case processes up to 32 locks, making the vector's sequential access pattern ideal for modern CPU cache hierarchies.

Fixes time-mixing bugs where mocked time (controllable in tests) was being used to compute steady_clock deadlines. Since mocked time and system time move independently, this caused incorrect wait behavior in tests. Changes: - Use steady_clock::time_point for all wait deadlines (nextAttemptTime, lastCleanupTimeSteady) - Keep mocked time (GetTime<>()) for business logic only (timeouts, session tracking) - Remove redundant workEpoch increments (NotifyWorker already does this) - Move NotifyWorker() calls to individual message handlers for better control This ensures that: 1. In production: steady_clock provides monotonic, reliable timing 2. In tests: mocked time controls business logic while steady_clock handles waits 3. No double-incrementing of workEpoch that could cause busy-wait issues

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/instantsend/instantsend.cpp (1)
1-1: Fix clang-format differences.

The pipeline reports clang-format differences in this file. Please run the formatting script to ensure code style consistency.

Run the following command to fix formatting:
clang-format -i src/instantsend/instantsend.cpp

♻️ Duplicate comments (1)

src/llmq/signing_shares.cpp (1)
1510-1517: Tight loop when didSend/fMoreWork → likely cause of 100% CPU. Add a micro coalescing wait.

Continuing immediately after sending or detecting more work can hot‑loop under load. Add a very short wait with the same epoch predicate to batch messages and avoid CPU peg, preserving sub‑ms latency.
-        // If there is more work or we just sent something, iterate again without waiting
-        if (fMoreWork || didSend) {
-            continue;
-        }
+        // If there is more work or we just sent something, briefly coalesce to avoid hot looping
+        if (fMoreWork || didSend) {
+            std::unique_lock<Mutex> l(workMutex);
+            workCv.wait_for(l, std::chrono::milliseconds(1), [this, startEpoch]{
+                return bool(workInterrupt) || workEpoch.load(std::memory_order_acquire) != startEpoch;
+            });
+            continue;
+        }
This also addresses the previously raised concern about removing the SendMessages cooldown.

🧹 Nitpick comments (4)

src/llmq/signing.cpp (1)
805-806: Nit: thread name mismatch.

CSigningManager worker is named "sigshares". Prefer a specific name like "recsigs" for profiling.
-    workThread = std::thread(&util::TraceThread, "sigshares", [this, &peerman] { WorkThreadMain(peerman); });
+    workThread = std::thread(&util::TraceThread, "recsigs", [this, &peerman] { WorkThreadMain(peerman); });
src/llmq/signing_shares.cpp (2)

1260-1281: Message batching cadence: validate no regression without explicit cooldown.

Given frequent immediate loops, ensure QSIG* batching still groups efficiently under typical traffic. If not, consider a small per-peer send window (1–2ms) similar to the coalescing wait above.

1472-1498: Wake on BanNode is good, but double-check request re-routing pressure.

After banning, mass re-requests can cascade. Consider limiting per-iteration re-requests to avoid bursts.
src/instantsend/instantsend.cpp (1)
222-229: Optional: Remove trailing blank line.

Line 229 contains a trailing blank line that could be removed for consistency.

Apply this diff:
             filteredPend.push_back(std::move(p));
         }
     }
-    
+
     // Now check against the previous active set and perform banning if this fails

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5177f80 and ae2dc7a.

📒 Files selected for processing (6)

src/instantsend/instantsend.cpp (20 hunks)
src/instantsend/instantsend.h (5 hunks)
src/llmq/signing.cpp (6 hunks)
src/llmq/signing.h (3 hunks)
src/llmq/signing_shares.cpp (13 hunks)
src/llmq/signing_shares.h (5 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

src/**/*.{cpp,h,cc,cxx,hpp}

📄 CodeRabbit inference engine (CLAUDE.md)

src/**/*.{cpp,h,cc,cxx,hpp}: Dash Core C++ codebase must be written in C++20 and require at least Clang 16 or GCC 11.1
Dash uses unordered_lru_cache for efficient caching with LRU eviction

Files:

src/llmq/signing.cpp
src/llmq/signing_shares.h
src/llmq/signing.h
src/llmq/signing_shares.cpp
src/instantsend/instantsend.h
src/instantsend/instantsend.cpp

🧬 Code graph analysis (6)

src/llmq/signing.cpp (3)

src/llmq/signing.h (1)

NotifyWorker (252-252)

src/llmq/signing_shares.cpp (7)

NotifyWorker (1699-1704)

NotifyWorker (1699-1699)

WorkThreadMain (1500-1553)

WorkThreadMain (1500-1500)

Cleanup (1297-1434)

Cleanup (1297-1297)

l (1541-1541)

src/instantsend/instantsend.cpp (3)

WorkThreadMain (933-970)

WorkThreadMain (933-933)

l (965-965)

src/llmq/signing_shares.h (1)

src/llmq/signing_shares.cpp (6)

SignPendingSigShares (1563-1587)

SignPendingSigShares (1563-1563)

WorkThreadMain (1500-1553)

WorkThreadMain (1500-1500)

NotifyWorker (1699-1704)

NotifyWorker (1699-1699)

src/llmq/signing.h (4)

src/llmq/signing.cpp (8)

WorkThreadMain (826-856)

WorkThreadMain (826-826)

StartWorkerThread (798-806)

StartWorkerThread (798-798)

StopWorkerThread (808-818)

StopWorkerThread (808-808)

InterruptWorkerThread (820-824)

InterruptWorkerThread (820-820)

src/llmq/signing_shares.cpp (10)

WorkThreadMain (1500-1553)

WorkThreadMain (1500-1500)

StartWorkerThread (182-191)

StartWorkerThread (182-182)

StopWorkerThread (193-203)

StopWorkerThread (193-193)

InterruptWorkerThread (215-220)

InterruptWorkerThread (215-215)

NotifyWorker (1699-1704)

NotifyWorker (1699-1699)

src/instantsend/instantsend.cpp (2)

WorkThreadMain (933-970)

WorkThreadMain (933-933)

src/instantsend/instantsend.h (2)

InterruptWorkerThread (117-117)

NotifyWorker (146-149)

src/llmq/signing_shares.cpp (5)

src/llmq/signing.h (1)

NotifyWorker (252-252)

src/instantsend/instantsend.h (1)

NotifyWorker (146-149)

src/util/time.h (1)

now (21-55)

src/llmq/signing.cpp (5)

WorkThreadMain (826-856)

WorkThreadMain (826-826)

Cleanup (651-665)

Cleanup (651-651)

l (835-835)

src/instantsend/instantsend.cpp (3)

WorkThreadMain (933-970)

WorkThreadMain (933-933)

l (965-965)

src/instantsend/instantsend.h (5)

src/instantsend/lock.h (1)

instantsend (18-41)

src/llmq/signing.cpp (2)

InterruptWorkerThread (820-824)

InterruptWorkerThread (820-820)

src/llmq/signing_shares.cpp (4)

InterruptWorkerThread (215-220)

InterruptWorkerThread (215-215)

NotifyWorker (1699-1704)

NotifyWorker (1699-1699)

src/instantsend/instantsend.cpp (4)

ProcessPendingInstantSendLocks (170-235)

ProcessPendingInstantSendLocks (170-170)

ProcessPendingInstantSendLocks (237-347)

ProcessPendingInstantSendLocks (237-240)

src/llmq/signing.h (1)

NotifyWorker (252-252)

src/instantsend/instantsend.cpp (2)

src/llmq/signing.h (1)

NotifyWorker (252-252)

src/instantsend/instantsend.h (1)

NotifyWorker (146-149)

🪛 GitHub Actions: Clang Diff Format Check

src/llmq/signing.cpp