Add nvidia-bug-report to eks-logs-collector #1864
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Issue #, if available:
N/A
Description of changes:
This PR adds the execution of
nvidia-bug-report.sh
in the eks-logs-collector. This executable is part of the Nvidia drivers and is useful for debugging. Script is alsot mentioned in https://docs.nvidia.com/deploy/gpu-debug-guidelines/index.htmlBy submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.
Testing Done
I tested this script on a g4dn instance which has an Nvidia GPU, and verified that the log.gz file created by
nvidia-bug-report.sh
is included in the log collector archive.Also ran the script against a t3.large to make sure the script doesn't break -
See this guide for recommended testing for PRs. Some tests may not apply. Completing tests and providing additional validation steps are not required, but it is recommended and may reduce review time and time to merge.