Add aarch64-Compatible Base Image for MLPerf Inference & Fix TensorRT Version Matching #478
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Abstract
This pull request updates the
base_imagefields in multiple_cm.yamlconfiguration files within the MLPerf inference pipeline to ensure compatibility with theaarch64Grace Hopper architecture. The existing configurations are hardcoded to usex86_64-compatible MLPerf containers, which are not optimized foraarch64systems.Technical Details
Scope of Changes:
_cm.yamlfiles where thebase_imageis set to an x86_64 MLPerf inference container.app-mlperf-inferenceand other relevant script directories within the repository.Automation via Patch:
.patchfile to automate the replacement process.base_imageentries pointing to MLPerf-x86_64images with theaarch64Grace Hopper-compatible image.Updated
base_imageValue:Validation
The patch has been tested using the following command, which successfully pulled the Grace Hopper-compatible container after applying the patch:
This patch enables smooth MLPerf inference on
aarch64systems, eliminating the need for users to manually update variables across scripts. It offers a streamlined, automated and tested solution for Grace Hopper compatibility.Additionally, we corrected a minor bug in
get-tensorrt/customize.pyto improve regex matching, allowing it to recognize version tags with multiple digits.