Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ebpf agent cannot run successfully in okd 4.14 cluster #516

Open
kaolaaz163 opened this issue Dec 12, 2023 · 7 comments
Open

ebpf agent cannot run successfully in okd 4.14 cluster #516

kaolaaz163 opened this issue Dec 12, 2023 · 7 comments

Comments

@kaolaaz163
Copy link

I am using a cluster of OKD 4.14,Both loki operator and network observability operator have been successfully installed. When running ebpf agent, its status is shown as CrashLoopBackOff.

1702371290124

The pod of ebpf agent has the following error log. Can anyone help me figure out what the problem is?

1702371351324

Copy link

Congratulations for contributing your first netobserv-operator issue

@jotak
Copy link
Member

jotak commented Dec 12, 2023

Hi @kaolaaz163
Which version of netobserv are you using? Did you install a release from operatorhub, or directly from operator repo / main branch ? Can you also tell what's the docker image used in the agent pods?

@jotak
Copy link
Member

jotak commented Dec 12, 2023

cc @msherif1234

@kaolaaz163
Copy link
Author

kaolaaz163 commented Dec 12, 2023

@jotak The environment information I use is as follows.I installed operator directly through OperatorHub.

OpenShift: OKD 4.14.0-0.okd-2023-11-14-101924
Network Observability Operator: 1.4.2
Loki Operator: 5.8.0
ebpf agent image:registry.redhat.io/network-observability/network-observability-ebpf-agent-rhel9@sha256:5dee0d22ca62ab56c1b3d07147a31602d9106a24c423740a75405a82eece8a7f

@kaolaaz163
Copy link
Author

Can anyone help me figure out what the problem is?

@msherif1234
Copy link
Contributor

msherif1234 commented Dec 15, 2023

This is a known kernel bug in 6.5.5 that I filed and fix is in review
https://lore.kernel.org/bpf/20231030132145.20867-1-shung-hsi.yu@suse.com/

@jotak
Copy link
Member

jotak commented Dec 15, 2023

As a side comment, we don't systematically test OKD fcos (based on fedora) so for better guarantees of stability I'd rather suggest to use OKD scos (based on centos stream), which is on par with the rhel-based distros that we support. Cf for instance the scos-stable list here: https://amd64.origin.releases.ci.openshift.org/#4-scos-stable

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants