Optimum amd support #464

michaelfeil · 2024-11-14T05:41:02Z

Related Issue

Checklist

I have read the CONTRIBUTING guidelines.
I have added tests to cover my changes.
I have updated the documentation (docs folder) accordingly.

Additional Notes

Add any other context about the PR here.

Rocm Onnxruntime

greptile-apps

PR Summary

Added comprehensive AMD GPU support to Infinity, enabling model deployment and optimization on AMD MI200/MI300 series GPUs with architecture-specific configurations.

Added AMD-specific Docker configurations in /libs/infinity_emb/Docker.template.yaml with separate build paths for gfx90a/gfx942 (source build) and gfx1100 (pre-built wheel)
Integrated MIGraphX and ROCm execution providers in /libs/infinity_emb/infinity_emb/transformer/utils_optimum.py for AMD GPU optimization
Added AMD deployment documentation in /docs/docs/deploy.md with specific run commands and flags for AMD GPUs
Disabled BetterTransformer for AMD platforms due to compatibility issues
Added CHECK_OPTIMUM_AMD optional import for AMD-specific optimizations

_{6 file(s) reviewed, 10 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

docs/docs/deploy.md

greptile-apps · 2024-11-14T05:42:12Z

libs/infinity_emb/Docker.template.yaml

+    # GPU architecture specific installations
+    RUN cd /opt/rocm/share/amd_smi && python -m pip wheel . --wheel-dir=/install
+    RUN apt update -y && apt install migraphx -y
+    RUN if [ "$GPU_ARCH" = "gfx90a" ] || [ "$GPU_ARCH" = "gfx942" ]; then \


logic: Consider adding error handling and logging for failed installations. The script continues silently if any of the installation steps fail.

libs/infinity_emb/Docker.template.yaml

libs/infinity_emb/Dockerfile.amd_auto

greptile-apps · 2024-11-14T05:43:38Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

+        elif "MIGraphXExecutionProvider" in available:
+            return "MIGraphXExecutionProvider"  # swapped order of ROCM and MIGraphX
        elif "ROCMExecutionProvider" in available:


logic: MIGraphX is prioritized over ROCM here but reversed in the CUDA section above. Consider using consistent ordering.

greptile-apps · 2024-11-14T05:43:38Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

    if files_optimized:
-        file_optimized = files_optimized[0]
+        file_optimized = files_optimized[-1]


style: Using files_optimized[-1] could be unstable if multiple optimized versions exist. Consider using version sorting or timestamps.

codecov-commenter · 2024-11-14T05:52:33Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 60.00000% with 6 lines in your changes missing coverage. Please review.

Project coverage is 78.92%. Comparing base (d9050a7) to head (e6d3fa4).

Files with missing lines	Patch %	Lines
...nity_emb/infinity_emb/transformer/utils_optimum.py	57.14%	6 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #464      +/-   ##
==========================================
- Coverage   79.04%   78.92%   -0.13%     
==========================================
  Files          42       42              
  Lines        3408     3417       +9     
==========================================
+ Hits         2694     2697       +3     
- Misses        714      720       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tjtanaa and others added 13 commits October 26, 2024 14:02

add optimum-amd support

6e47aef

update documentation

be7cd61

add commit with onnxruntime

8ceeab0

update template

987d8c5

remove source build pipeline

8f88be1

docker build and push v3

129fd4a

Merge pull request #2 from michaelfeil/mf-optimum-amd-support

0e8b81b

Rocm Onnxruntime

complete and tested on radeon 7900 xtx

16bb74f

add migraphx compilation

93d7c27

add migraphx support to mi300x build

83ec2f1

finalize the amd docker setup command

30da7fb

update the dockerfile.amd

ae76a0c

Merge branch 'main' into optimum-amd-support

829c8a4

greptile-apps bot reviewed Nov 14, 2024

View reviewed changes

michaelfeil and others added 3 commits November 14, 2024 08:11

update docker image

d204b8a

add to readme

018cbb0

Merge branch 'main' into optimum-amd-support

e6d3fa4

michaelfeil merged commit f59df4f into main Nov 16, 2024
35 of 36 checks passed

michaelfeil mentioned this pull request Nov 16, 2024

[AMD] [ROCm] [Optimum] Add optimum-amd support #443

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimum amd support #464

Optimum amd support #464

Uh oh!

michaelfeil commented Nov 14, 2024

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Nov 14, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Nov 14, 2024

Uh oh!

greptile-apps bot Nov 14, 2024

Uh oh!

codecov-commenter commented Nov 14, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Optimum amd support #464

Optimum amd support #464

Uh oh!

Conversation

michaelfeil commented Nov 14, 2024

Related Issue

Checklist

Additional Notes

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Nov 14, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Nov 14, 2024

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Nov 14, 2024

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Nov 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Nov 14, 2024 •

edited

Loading