-
Notifications
You must be signed in to change notification settings - Fork 366
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Antrea Agent sometimes (rarely) fails to query supported OVS datapath features #6471
Comments
We wait (for a maximum of 5s) for the datapath_id of the br-int OVS bridge to be reported in OVSDB, after creating the bridge and before checking supported datapath features. This prevents errors when querying the supported features before the ofproto-dpif provider has been initialized. Fixes antrea-io#6471 Signed-off-by: Antonin Bas <antonin.bas@broadcom.com>
Some more details here: As you can see from the logs, the CLI call to check supported DP features (error is reported at This would indicate that the I believe that the solution proposed in #6472 will resolve the issue: it waits for the bridge (br-int) datapath ID to become available in OVSDB before querying supported datapath features.
I think that's a better solution that checking the OVS datapath features in a loop until it succeeds, even though that's debatable :P |
We wait (for a maximum of 5s) for the datapath_id of the br-int OVS bridge to be reported in OVSDB, after creating the bridge and before checking supported datapath features. This prevents errors when querying the supported features before the ofproto-dpif provider has been initialized. Fixes antrea-io#6471 Signed-off-by: Antonin Bas <antonin.bas@broadcom.com>
Describe the bug
On 2 separate occasions, I have observed the antrea-agent container crashing shortly after starting.
In both cases, this was on a new Kind cluster, and Antrea was being installed for the first time in the cluster.
The contents of the logs showed that the Agent had failed to query the supported datapath features from OVS, and hence had exited early with an error:
After the automatic restart, the Agent was running correctly as expected. So this is not a major issue in any way, just a small inconvenience.
To Reproduce
It doesn't happen very often, but this was observed during a standard install:
You may notice that the logs above show that the Agent was installed in noEncap mode. However, I have observed this same issue once in encap mode and once in noEncap mode, during 2 separate installations.
Versions:
Antrea v2.0.0
Additional context
The matching ovs-vswitchd logs:
The text was updated successfully, but these errors were encountered: