HBASE-29386: SnapshotProcedure and EnableTableProcedure can cause a deadlock #7084

hgromer · 2025-06-10T01:39:29Z

No description provided.

hgromer · 2025-06-10T01:40:46Z

@rmdmattingly @ndimiduk @charlesconnell @sidkhillon @krconv

hgromer · 2025-06-10T01:43:21Z

...rc/test/java/org/apache/hadoop/hbase/master/procedure/TestSnapshotProcedureConcurrently.java

@@ -136,4 +138,48 @@ public void run() {
    SnapshotTestingUtils.confirmSnapshotValid(TEST_UTIL, snapshotProto, TABLE_NAME, CF);
    SnapshotTestingUtils.confirmSnapshotValid(TEST_UTIL, snapshotOnSameTableProto, TABLE_NAME, CF);
  }
+
+  @Test
+  public void testItCanEnableTableWhileSnapshotProcedureIsRunning() throws Exception {


This test fails without my changes, both procedures will run forever. This also functions as the reproduction of the phenomenon.

For reproducibility, I manually set the table state to ENABLING, and then kick off the EnableTableProcedure at the stage it gets stuck on (after subprocesses finish)

hgromer · 2025-06-10T01:47:13Z

...-procedure/src/main/java/org/apache/hadoop/hbase/procedure2/ProcedureSuspendedException.java

+    this.shouldForceLockRelease = shouldForceLockRelease;
+  }
+
+  public boolean shouldForceLockRelease() {


The Procedure class has a holdLock method which returns true for SnapshotProcedure. This prevents us from releasing the lock even on suspension. I believe that at the stage where the deadlock could happen, it's safe to release the lock. We haven't done any snapshotting yet, so I believe there shouldn't be any weird data inconsistencies from region splits/merges.

Shouldn't we just modify the SnapshotProcedure.holdLock() behaviour? Afterall, this is contrary to what we say in that method comment:

protected boolean holdLock(MasterProcedureEnv env) { // In order to avoid enabling/disabling/modifying/deleting table during snapshot, // we don't release lock during suspend return true; }
And you are already modifying SnapshotProcedure execution logic to reflect the special condition when the lock should be released, so better keep this there and avoid changing ProcedureSuspendedException?

I don't think we should. If we did, we'd have potential inconsistencies. As I mention in my comment, this bypass allows us to release the lock prior to us actually snapshotting any data. Once we start snapshotting (anything in at or past SNAPSHOT_SNAPSHOT_ONLINE_REGIONS) it is unsafe to release the lock.

I think the comment still holds. We want to avoid enabling/disabling/modifying/deleting a table during a snapshot. The procedure suspension occurs prior to us starting to snapshot any data, at SNAPSHOT_WRITE_SNAPSHOT_INFO.

Alternatively, maybe a less confusing option would be to return a non-static value for holdLock in SnapshotProcedure

This forces you to manage additional state at each stage though, which I why I opted for this implementation which I thought was safer

I think the comment still holds. We want to avoid enabling/disabling/modifying/deleting a table during a snapshot.

We may want to avoid, but now we are allowing a case where we suspend and release the lock.

Alternatively, maybe a less confusing option would be to return a non-static value for holdLock in SnapshotProcedure

That's what I meant by "modify the SnapshotProcedure.holdLock() behaviour". You are already modifying SnapshotProcedure behaviour to cause a release lock suspension via a flag in the ProcedureSuspendedException. We could keep this "flag" internally in SnapshotProcedure and use it to define what SnapshotProcedure.holdLock() returns.

This forces you to manage additional state at each stage though, which I why I opted for this implementation which I thought was safer

True. Yet, it seems more intuitive to me. But that's a personal opinion, just wanted to make sure it was also considered. Up to you to decide on the "final" solution. If going with the original, just please amend the comment in holdLock to explain the scenario where we release the lock.

hgromer · 2025-06-10T01:50:31Z

hbase-server/src/main/java/org/apache/hadoop/hbase/master/procedure/SnapshotProcedure.java

          TableState tableState =
            env.getMasterServices().getTableStateManager().getTableState(snapshotTable);
          if (tableState.isEnabled()) {
+            SnapshotDescriptionUtils.writeSnapshotInfo(snapshot, workingDir, workingDirFS);


Move inside the if statements to avoid unnecessary I/O in case we need to suspend

charlesconnell · 2025-06-10T13:27:06Z

Another option, which is simpler to reason about, is to simply fail the SnapshotProcedure if the table isn't in a state it can handle. This was my approach in HBASE-29315, for the exact same reasons you're encountering here. I couldn't usefully "sleep" a SplitTableRegionProcedure because it used holdLock=true. I think it would be good to agree on a standard of what to do in these situations.

Failing a procedure is simpler and doesn't introduce more edge cases in the procedure executor state machine. However, obviously, it's a better user experience if your procedures get executed eventually.

hgromer · 2025-06-10T13:41:19Z

Another option, which is simpler to reason about, is to simply fail the SnapshotProcedure if the table isn't in a state it can handle. This was my approach in HBASE-29315, for the exact same reasons you're encountering here. I couldn't usefully "sleep" a SplitTableRegionProcedure because it used holdLock=true. I think it would be good to agree on a standard of what to do in these situations.

Failing a procedure is simpler and doesn't introduce more edge cases in the procedure executor state machine. However, obviously, it's a better user experience if your procedures get executed eventually.

Agreed. The reason I opted for supsending the procedure is because this is already something we do if we are splitting or merging regions. So this procedure will already suspend and resume in the case the table state isn't optimal.

I don't think I have a very strong opinion one way or another, though. Enabling/disabling tables happens often enough that it might be fine to simply fail and force the user to manually handle the failure.

cc @Apache9 Don't know if you have any strong opinions here either way.

rmdmattingly · 2025-06-10T15:53:56Z

Enabling/disabling tables happens often enough that it might be fine to simply fail and force the user to manually handle the failure.

Do you mean "happens ~~often~~ infrequently enough"? If so, I agree that snapshotting a disabled table should be a pretty exceptional request, and so it is fine to err on the side of simplicity and just fail the procedure

hgromer · 2025-06-10T16:28:06Z

Enabling/disabling tables happens often enough that it might be fine to simply fail and force the user to manually handle the failure.

Do you mean "happens ~~often~~ infrequently enough"? If so, I agree that snapshotting a disabled table should be a pretty exceptional request, and so it is fine to err on the side of simplicity and just fail the procedure

Yes I did; I am happy to fail the procedure if that's the general consensus. That being said, I do think modern backups start to stress this system out. Any call to the BackupAdmin triggers an enable table procedure, so we're more likely to get into this state.

If we are okay with failing, then the user will have to manually kick off another snapshot on failure.

Another possible solution would be to reset the state of the snapshot procedure, so that it needs to run from the beginning

if (tableState.isEnabled()) {
  // action 
} else if (tableState.isDisabled) {
  // action
} else {
  // set up suspension timeout/persistence
  setNextState(SnapshotState.SNAPSHOT_PREPARE);
}

hgromer · 2025-06-10T18:14:22Z

going to proceed with the strategy of simply failing

charlesconnell

lgtm

Apache9

When implementing this SnapshotProcedure, we decided to use shared lock and holdLock = true to prevent other table procedure jumps in in the middle of our execution while not hurting the availability because exclusive lock will also prevent region assigning.

In HBASE-28683, we introduced a new way to only allow one procedure to run at the same time for the same table, so maybe a possible way is to make SnapshotProcedure also acquire exclusive lock and set holdLock = false, so it will not be executed at the same time with Enable and Disable procedure.

Thanks.

Apache9 · 2025-06-11T01:35:22Z

hbase-server/src/main/java/org/apache/hadoop/hbase/master/procedure/SnapshotProcedure.java

            setNextState(SnapshotState.SNAPSHOT_SNAPSHOT_CLOSED_REGIONS);
+          } else {
+            setState(ProcedureState.FAILED);
+            throw new ProcedureAbortedException(


Just use setFailure and then return Flow.NO_MORE_STATE.

Will do, thank you for the suggestion

hgromer · 2025-06-11T12:30:36Z

When implementing this SnapshotProcedure, we decided to use shared lock and holdLock = true to prevent other table procedure jumps in in the middle of our execution while not hurting the availability because exclusive lock will also prevent region assigning.

In HBASE-28683, we introduced a new way to only allow one procedure to run at the same time for the same table, so maybe a possible way is to make SnapshotProcedure also acquire exclusive lock and set holdLock = false, so it will not be executed at the same time with Enable and Disable procedure.

Thanks.

That's good to know, thank you for the context. The issue here is that the EnableTableProcedure will release the lock after it's subprocedures finish, which allows the SnapshotProcedure to execute on a table that is enabling. The SnapshotProcedure then gets stuck continuously re-running the same state over again, and will refuse to release the lock, creating a deadlock

With all this said, I think it makes the most sense, for simplicity's sake, to fail the procedure if the table is in an invalid state.

Apache9 · 2025-06-13T02:01:41Z

When implementing this SnapshotProcedure, we decided to use shared lock and holdLock = true to prevent other table procedure jumps in in the middle of our execution while not hurting the availability because exclusive lock will also prevent region assigning.
In HBASE-28683, we introduced a new way to only allow one procedure to run at the same time for the same table, so maybe a possible way is to make SnapshotProcedure also acquire exclusive lock and set holdLock = false, so it will not be executed at the same time with Enable and Disable procedure.
Thanks.

That's good to know, thank you for the context. The issue here is that the EnableTableProcedure will release the lock after it's subprocedures finish, which allows the SnapshotProcedure to execute on a table that is enabling. The SnapshotProcedure then gets stuck continuously re-running the same state over again, and will refuse to release the lock, creating a deadlock

With all this said, I think it makes the most sense, for simplicity's sake, to fail the procedure if the table is in an invalid state.

Using the mechanism in HBASE-28683, SnapshotProcedure can not be executed together with EnableTableProcedure/DisabledTableProcedure together, which could also solve the problem, and I think it is more stable. I'm not sure whether ModifyTableProcedure could also affect SnapshotProcedure if it jumps in just in the middle of the execution...

hgromer · 2025-06-13T16:13:17Z

When implementing this SnapshotProcedure, we decided to use shared lock and holdLock = true to prevent other table procedure jumps in in the middle of our execution while not hurting the availability because exclusive lock will also prevent region assigning.
In HBASE-28683, we introduced a new way to only allow one procedure to run at the same time for the same table, so maybe a possible way is to make SnapshotProcedure also acquire exclusive lock and set holdLock = false, so it will not be executed at the same time with Enable and Disable procedure.
Thanks.

That's good to know, thank you for the context. The issue here is that the EnableTableProcedure will release the lock after it's subprocedures finish, which allows the SnapshotProcedure to execute on a table that is enabling. The SnapshotProcedure then gets stuck continuously re-running the same state over again, and will refuse to release the lock, creating a deadlock
With all this said, I think it makes the most sense, for simplicity's sake, to fail the procedure if the table is in an invalid state.

Using the mechanism in HBASE-28683, SnapshotProcedure can not be executed together with EnableTableProcedure/DisabledTableProcedure together, which could also solve the problem, and I think it is more stable. I'm not sure whether ModifyTableProcedure could also affect SnapshotProcedure if it jumps in just in the middle of the execution...

Yes, I'm able to avoid the deadlock if the SnapshotProcedure both sets holdLock = true and also yields after procedure execution. Essentially allowing both SnapshotProcedure and EnableTableProcedure to execute without the need to throw any sort of yield / suspended exception (which was my initial implementation).

This is certainly a cleaner implementation, though it means we can interleave other table procedures after cycles of the SnapshotProcedure.

You hint at this here

I'm not sure whether ModifyTableProcedure could also affect SnapshotProcedure if it jumps in just in the middle of the execution...

and I believe that it would be problematic because you may snapshot regions multiple times, regions may move or go offline, so you may miss out on snapshotting regions.

My initial implementation prevented this by only allowing SNAPSHOT_WRITE_SNAPSHOT_INFO to release the lock (when it's still safe to do so). However, I don't think it's safe to yield the lock after any cycle.

After some thought, I think we still have two solutions here:

My initial implementation, which throws an exception that indicates we should release the lock from within SNAPSHOT_WRITE_SNAPSHOT_INFO
Fail the procedure

It seems the consensus is to avoid complicating the procedure and fail if we encounter the table in an invalid state, so I'm leaning towards 2 thought admittedly I'm a bit torn

Apache9 · 2025-06-14T09:40:03Z

This is certainly a cleaner implementation, though it means we can interleave other table procedures after cycles of the SnapshotProcedure.

FWIW, if a SnapshotProcedure starts, no other table procedures which need a exclusive lock can be executed...

And what I mean is that, we can reuse the mechanism introduced in HBASE-28683 to simply fix the problem.

Just change the code here

hbase/hbase-server/src/main/java/org/apache/hadoop/hbase/master/procedure/TableQueue.java

Line 62 in 64c582f

case SNAPSHOT:

Make Snapshot also return true, and change SnapshotProcedure's acquireLock method to also require exclusive lock, change holdLock to return false, then we are safe.

Thanks.

hgromer · 2025-06-14T16:00:58Z

This is certainly a cleaner implementation, though it means we can interleave other table procedures after cycles of the SnapshotProcedure.

FWIW, if a SnapshotProcedure starts, no other table procedures which need a exclusive lock can be executed...

And what I mean is that, we can reuse the mechanism introduced in HBASE-28683 to simply fix the problem.

Just change the code here

hbase/hbase-server/src/main/java/org/apache/hadoop/hbase/master/procedure/TableQueue.java

Line 62 in 64c582f

case SNAPSHOT:

Make Snapshot also return true, and change SnapshotProcedure's acquireLock method to also require exclusive lock, change holdLock to return false, then we are safe.

Thanks.

Ah okay, I understand. This does prevent us from being able to take multiple snapshots of the table, as mentioned in this comment but that seems ok

Apache9 · 2025-06-18T15:01:38Z

TestSnapshotProcedureRIT.testTableInMergeWhileTakingSnapshot failed, I guess this is related to our changes here?

hgromer · 2025-06-18T17:15:34Z

TestSnapshotProcedureRIT.testTableInMergeWhileTakingSnapshot failed, I guess this is related to our changes here?

Yes must be related I am taking a look

Apache-HBase · 2025-06-20T18:26:35Z

🎊 +1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 32s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	hbaseanti	0m 0s		Patch does not have any anti-patterns.
			_ master Compile Tests _
+1 💚	mvninstall	3m 54s		master passed
+1 💚	compile	3m 24s		master passed
+1 💚	checkstyle	0m 38s		master passed
+1 💚	spotbugs	1m 40s		master passed
+1 💚	spotless	0m 49s		branch has no errors when running spotless:check.
			_ Patch Compile Tests _
+1 💚	mvninstall	3m 7s		the patch passed
+1 💚	compile	3m 17s		the patch passed
+1 💚	javac	3m 17s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	0m 37s		the patch passed
+1 💚	spotbugs	1m 42s		the patch passed
+1 💚	hadoopcheck	12m 16s		Patch does not cause any errors with Hadoop 3.3.6 3.4.0.
+1 💚	spotless	0m 45s		patch has no errors when running spotless:check.
			_ Other Tests _
+1 💚	asflicense	0m 10s		The patch does not generate ASF License warnings.
		40m 38s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-7084/9/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#7084
JIRA Issue	HBASE-29386
Optional Tests	dupname asflicense javac spotbugs checkstyle codespell detsecrets compile hadoopcheck hbaseanti spotless
uname	Linux 4e14747c481e 5.4.0-1103-aws #111~18.04.1-Ubuntu SMP Tue May 23 20:04:10 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `f14a7cf`
Default Java	Eclipse Adoptium-17.0.11+9
Max. process+thread count	84 (vs. ulimit of 30000)
modules	C: hbase-server U: hbase-server
Console output	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-7084/9/console
versions	git=2.34.1 maven=3.9.8 spotbugs=4.7.3
Powered by	Apache Yetus 0.15.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2025-06-20T21:48:37Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 31s		Docker mode activated.
-0 ⚠️	yetus	0m 3s		Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --author-ignore-list --blanks-eol-ignore-file --blanks-tabs-ignore-file --quick-hadoopcheck
			_ Prechecks _
			_ master Compile Tests _
+1 💚	mvninstall	3m 53s		master passed
+1 💚	compile	0m 58s		master passed
+1 💚	javadoc	0m 28s		master passed
+1 💚	shadedjars	6m 5s		branch has no errors when building our shaded downstream artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	3m 11s		the patch passed
+1 💚	compile	0m 57s		the patch passed
+1 💚	javac	0m 57s		the patch passed
+1 💚	javadoc	0m 27s		the patch passed
+1 💚	shadedjars	6m 2s		patch has no errors when building our shaded downstream artifacts.
			_ Other Tests _
-1 ❌	unit	214m 55s	/patch-unit-hbase-server.txt	hbase-server in the patch failed.
		242m 39s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-7084/9/artifact/yetus-jdk17-hadoop3-check/output/Dockerfile
GITHUB PR	#7084
JIRA Issue	HBASE-29386
Optional Tests	javac javadoc unit compile shadedjars
uname	Linux 9a40d0fa1370 5.4.0-1103-aws #111~18.04.1-Ubuntu SMP Tue May 23 20:04:10 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `f14a7cf`
Default Java	Eclipse Adoptium-17.0.11+9
Test Results	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-7084/9/testReport/
Max. process+thread count	5312 (vs. ulimit of 30000)
modules	C: hbase-server U: hbase-server
Console output	https://ci-hbase.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-7084/9/console
versions	git=2.34.1 maven=3.9.8
Powered by	Apache Yetus 0.15.0 https://yetus.apache.org

This message was automatically generated.

hgromer commented Jun 10, 2025

View reviewed changes

hgromer force-pushed the HBASE-29386 branch from be9f2b4 to 18bd20f Compare June 10, 2025 01:48

hgromer commented Jun 10, 2025

View reviewed changes