Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automated cherry pick of #4491: Make memberlist cluster rejoin dead nodes periodically #4527

Conversation

xliuxu
Copy link
Contributor

@xliuxu xliuxu commented Jan 3, 2023

Cherry pick of #4491 on release-1.9.

#4491: Make memberlist cluster rejoin dead nodes periodically

For details on the cherry pick process, see the cherry pick requests page.

@xliuxu xliuxu added the kind/cherry-pick Categorizes issue or PR as related to the cherry-pick of a bug fix from the main branch to a release label Jan 3, 2023
@codecov
Copy link

codecov bot commented Jan 3, 2023

Codecov Report

Merging #4527 (015aaec) into release-1.9 (85bc745) will increase coverage by 0.18%.
The diff coverage is 85.50%.

Impacted file tree graph

@@               Coverage Diff               @@
##           release-1.9    #4527      +/-   ##
===============================================
+ Coverage        64.39%   64.57%   +0.18%     
===============================================
  Files              393      393              
  Lines            55570    55600      +30     
===============================================
+ Hits             35783    35904     +121     
+ Misses           17178    17092      -86     
+ Partials          2609     2604       -5     
Flag Coverage Δ
e2e-tests 39.40% <76.81%> (?)
integration-tests 34.54% <ø> (-0.01%) ⬇️
kind-e2e-tests 49.01% <76.81%> (+0.17%) ⬆️
unit-tests 48.31% <50.72%> (-0.02%) ⬇️
Impacted Files Coverage Δ
pkg/agent/memberlist/cluster.go 78.39% <85.50%> (+1.86%) ⬆️

... and 30 files with indirect coverage changes

@xliuxu
Copy link
Contributor Author

xliuxu commented Jan 3, 2023

/test-all

@tnqn
Copy link
Member

tnqn commented Jan 17, 2023

@xliuxu The upgrade test still failed after image is updated. It may be because the PR is not rebased. Could you rebase it and the other one?

The patch periodically rejoins Nodes that were removed from the member
list by memberlist because they were unreachable for more than 15
seconds (the GossipToTheDeadTime we are using). Without it, once there
is a network downtime lasting more than 15 seconds, the agent wouldn't
try to reach any other Node and would think it's the only alive Node
until it's restarted.

Signed-off-by: Quan Tian <qtian@vmware.com>
@xliuxu xliuxu force-pushed the automated-cherry-pick-of-#4491-upstream-release-1.9 branch from ea17a54 to 015aaec Compare January 17, 2023 05:04
@xliuxu
Copy link
Contributor Author

xliuxu commented Jan 17, 2023

@xliuxu The upgrade test still failed after image is updated. It may be because the PR is not rebased. Could you rebase it and the other one?

Rebased on release-1.9.
/test-all

@tnqn
Copy link
Member

tnqn commented Mar 27, 2023

/skip-all

@tnqn tnqn merged commit 114b63b into antrea-io:release-1.9 Mar 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/cherry-pick Categorizes issue or PR as related to the cherry-pick of a bug fix from the main branch to a release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants