Skip to content

[chore] k8s e2e test enhancement #26068

Open
@fatsheep9146

Description

Component(s)

internal/k8stest

Is your feature request related to a problem? Please describe.

The k8s e2e testing module encountered a critical bug #25849 a few days ago, which caused many CI processes to fail and significantly impacted the release schedule of the entire project. Although the root cause of the problem has been identified #25889 (though it is not a solution to all issues), this incident highlighted the need for enhancements in the e2e module.

  • On one hand, we need to reduce the probability of similar issues occurring again,
  • On the other hand, we should preserve more context to troubleshoot flaky tests if they occur again.

So I open this issue to track some follow-ups needed to enhance the e2e test, some of them is already discussed in slack channel

here are some follow-ups I came up

  • add e2e test for telemetrygen
  • add more error log or event when collector could not start in time
  • add more error log or event when telemetrygen could not start in time

Describe alternatives you've considered

No response

Additional context

No response

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    cmd/telemetrygentelemetrygen commandenhancementNew feature or requestinternal/k8stestnever staleIssues marked with this label will be never staled and automatically removed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions