e2e: Increase all ANR timeouts to 2m to ensure CI reliability. #1733

marun · 2023-07-19T05:47:53Z

Why this should be merged

CI seems to be exceeding many ANR-related timeouts. Rather than bumping timeouts piecemeal, all ANR timeouts are set to the same constant of 2 minutes.

How this works

How this was tested

abi87

Thanks for spotting this!

StephenButtolph · 2023-07-19T16:43:18Z

tests/e2e/e2e.go

-		// start is async, so wait some time for cluster health
-		time.Sleep(time.Minute)


Was this sleep just never needed? Like a minute long sleep is long

Maybe it was needed in the past, but it doesn't seem to be needed now.

Not neede anymore

And to be clear, this sleep ensured each e2e job was wasting 30-40s. At least with a timeout the check can complete as soon as the nodes are ready, but this sleep will never exit early if the nodes are healthy earlier than expected.

The missing piece here is understanding that Health actually blocks until Healthy.

yup that was very old ANR tech dep. with the first grpc server implementation, start (with blockchain creation -I know this is not the case but this is a copy probably-) broked posterior health call without sleeping some time.

e2e: Increase all ANR timeouts to 2m to ensure CI reliability.

f9a71be

marun requested review from abi87 and gyuho as code owners July 19, 2023 05:47

abi87 approved these changes Jul 19, 2023

View reviewed changes

abi87 assigned marun Jul 19, 2023

abi87 requested a review from StephenButtolph July 19, 2023 07:07

dhrubabasu approved these changes Jul 19, 2023

View reviewed changes

marun mentioned this pull request Jul 19, 2023

Write process context on node start to simplify test orchestration #1729

Merged

StephenButtolph reviewed Jul 19, 2023

View reviewed changes

Merge branch 'dev' into e2e-increase-timeouts

6dea4e6

StephenButtolph merged commit 79e59d2 into dev Jul 19, 2023

StephenButtolph deleted the e2e-increase-timeouts branch July 19, 2023 17:36

StephenButtolph added this to the v1.10.5 milestone Jul 19, 2023

StephenButtolph added ci This focuses on changes to the CI process testing This primarily focuses on testing labels Jul 19, 2023

marun mentioned this pull request Jul 20, 2023

upgrade: Increase all ANR timeouts to 2m to ensure CI reliability #1737

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

e2e: Increase all ANR timeouts to 2m to ensure CI reliability. #1733

e2e: Increase all ANR timeouts to 2m to ensure CI reliability. #1733

Uh oh!

marun commented Jul 19, 2023

Uh oh!

abi87 left a comment

Uh oh!

StephenButtolph Jul 19, 2023

Uh oh!

marun Jul 19, 2023

Uh oh!

felipemadero Jul 19, 2023

Uh oh!

marun Jul 19, 2023

Uh oh!

StephenButtolph Jul 19, 2023

Uh oh!

felipemadero Jul 19, 2023

Uh oh!

Uh oh!

		// start is async, so wait some time for cluster health
		time.Sleep(time.Minute)

e2e: Increase all ANR timeouts to 2m to ensure CI reliability. #1733

e2e: Increase all ANR timeouts to 2m to ensure CI reliability. #1733

Uh oh!

Conversation

marun commented Jul 19, 2023

Why this should be merged

How this works

How this was tested

Uh oh!

abi87 left a comment

Choose a reason for hiding this comment

Uh oh!

StephenButtolph Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

marun Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

felipemadero Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

marun Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

StephenButtolph Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

felipemadero Jul 19, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!