Open
Description
Component(s)
internal/k8stest
Is your feature request related to a problem? Please describe.
The k8s e2e testing module encountered a critical bug #25849 a few days ago, which caused many CI processes to fail and significantly impacted the release schedule of the entire project. Although the root cause of the problem has been identified #25889 (though it is not a solution to all issues), this incident highlighted the need for enhancements in the e2e module.
- On one hand, we need to reduce the probability of similar issues occurring again,
- On the other hand, we should preserve more context to troubleshoot flaky tests if they occur again.
So I open this issue to track some follow-ups needed to enhance the e2e test, some of them is already discussed in slack channel
here are some follow-ups I came up
- add e2e test for telemetrygen
- add more error log or event when collector could not start in time
- add more error log or event when telemetrygen could not start in time
Describe alternatives you've considered
No response
Additional context
No response
Activity