Skip to content

Conversation

truthbk
Copy link
Member

@truthbk truthbk commented Jan 3, 2025

What does this PR do?

This PR backports:

Motivation

Previous RC's showed an issue by which badly resolved secrets caused a CPU leak in the otel-agent. By properly resolving secrets with a new resolution pattern, we can now prevent such behavior.

Describe how you validated your changes

This was validated in gizmo.

Should also be validated with the correct and healthy behavior of an RC with the otel-agent enabled through-out staging with no noticeable increase in cpu or memory over time for the deployed agents.

Possible Drawbacks / Trade-offs

Additional Notes

This does not solve the root cause, which is in the datadog exporter and is caused by an accumulation of failed transactions. While a CPU increase can be caused by failed transaction and their reattempt, we need to make sure any resource impact is bounded.

@truthbk truthbk added changelog/no-changelog qa/rc-required Only for a PR that requires validation on the Release Candidate labels Jan 3, 2025
@truthbk truthbk added this to the 7.61.0 milestone Jan 3, 2025
@truthbk truthbk requested a review from a team as a code owner January 3, 2025 14:11
@truthbk truthbk requested a review from a team January 3, 2025 14:11
@truthbk truthbk requested review from a team as code owners January 3, 2025 14:11
@truthbk truthbk requested a review from a team January 3, 2025 14:11
@truthbk truthbk requested a review from a team as a code owner January 3, 2025 14:11
@truthbk truthbk requested review from songy23 and jeremy-hanna and removed request for a team January 3, 2025 14:11
@github-actions github-actions bot added long review PR is complex, plan time to review it [deprecated] team/agent-shared-components Deprecated. Use team/agent-configuration or team/agent-runtimes labels instead. labels Jan 3, 2025
@hush-hush hush-hush removed the request for review from jeremy-hanna January 3, 2025 14:27
cfg, err := fetchConfig(cs.ctx, cs.client, cs.Authtoken.Get(), cs.url.String())
if err != nil {
if cs.connected {
cs.Log.Warnf("Failed to fetch config from core agent: %v", err)
cs.Log.Warnf("Loosed connectivity to core-agent to fetch config: %v", err)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Better grammar..

Suggested change
cs.Log.Warnf("Loosed connectivity to core-agent to fetch config: %v", err)
cs.Log.Warnf("Lost connectivity to core-agent to fetch config: %v", err)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting, I made exactly the same comment in the original PR and thought that had been fixed.

Let's fix it in the main branch, and let that get picked up later. I'd rather not change the backported PRs at all.

Copy link
Contributor

@ajgajg1134 ajgajg1134 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good for agent-apm!

@agent-platform-auto-pr
Copy link
Contributor

Test changes on VM

Use this command from test-infra-definitions to manually test this PR changes on a VM:

inv create-vm --pipeline-id=52099698 --os-family=ubuntu

Note: This applies to commit 414ac31

@AliDatadog AliDatadog closed this Mar 4, 2025
@dd-devflow dd-devflow bot deleted the jaime/backport-configsync branch July 6, 2025 00:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
changelog/no-changelog [deprecated] team/agent-shared-components Deprecated. Use team/agent-configuration or team/agent-runtimes labels instead. long review PR is complex, plan time to review it qa/rc-required Only for a PR that requires validation on the Release Candidate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants