You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
When passing down "invalid" remote configuration from the otel controller to the supervisor, the supervisor doesn't report back in the RemoteConfigStatus status == failed. It does report back Unhealthy in the ComponentHealth with a LastError, but relying on that seems to break the opamp specification and it doesn't specify any details.
What is happening:
Pushed down valid yaml but with invalid collector config:
receive ComponentHealth.Healthy == false every 5 seconds with ComponentHealth.LastError:
Agent process PID={*} exited unexpectedly, exit code=1. Will restart in a bit...
agent.log file gets rewritten every 5 seconds with:
Error: failed to get config: cannot unmarshal the configuration: decoding failed due to the following error(s):
'' has invalid keys: force_invalid
2024/08/21 13:01:42 collector server run finished with error: failed to get config: cannot unmarshal the configuration: decoding failed due to the following error(s):
'' has invalid keys: force_invalid
Describe the solution you'd like
Call the collector validate command before starting and the agent. if that fails report the error message back in the RemoteConfigStatus.ErrorMessage with the correct status of Failed.
Describe alternatives you've considered
"Reuse" the ComponentHealth as the RemoteConfigStatus for now, but in my opinion that's a bad implementation of the opamp spec from both the controller as the supervisor.
Additional context
No response
The text was updated successfully, but these errors were encountered:
Component(s)
cmd/opampsupervisor
Is your feature request related to a problem? Please describe.
When passing down "invalid" remote configuration from the otel controller to the supervisor, the supervisor doesn't report back in the RemoteConfigStatus status == failed. It does report back Unhealthy in the ComponentHealth with a LastError, but relying on that seems to break the opamp specification and it doesn't specify any details.
What is happening:
Pushed down valid yaml but with invalid collector config:
First message send by supervisor has RemoteConfigStatus: (with corresponding LastRemoteConfigHash)
receive ComponentHealth.Healthy == false every 5 seconds with ComponentHealth.LastError:
agent.log file gets rewritten every 5 seconds with:
Describe the solution you'd like
Call the collector validate command before starting and the agent. if that fails report the error message back in the RemoteConfigStatus.ErrorMessage with the correct status of Failed.
Describe alternatives you've considered
"Reuse" the ComponentHealth as the RemoteConfigStatus for now, but in my opinion that's a bad implementation of the opamp spec from both the controller as the supervisor.
Additional context
No response
The text was updated successfully, but these errors were encountered: