Skip to content

Conversation

kaikulimu
Copy link
Collaborator

@kaikulimu kaikulimu commented Sep 25, 2025

In Partition FSM, we only call do_startWatchDog when we transition into a primary or replica. At this time, we should always clear any existing watchdog and start the timer anew.

This fixes error of form:

ERROR mqbc_storagemanager.cpp:1198 Cluster (clusterName) Partition [5]: Failed to cancel WatchDog, rc: 1

due to negligence to start watchdog.

Testing

Did manual testing to ensure watchdog starts anew every time.

Signed-off-by: Yuan Jing Vincent Yan <yyan82@bloomberg.net>
Signed-off-by: Yuan Jing Vincent Yan <yyan82@bloomberg.net>
Copy link

@bmq-oss-ci bmq-oss-ci bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Build 3050 of commit 925591c has completed with FAILURE

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants