Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dockerfile: Remove fixed rccl version in rocm5.1.x docker file #476

Merged
merged 2 commits into from
Feb 7, 2023
Merged

Dockerfile: Remove fixed rccl version in rocm5.1.x docker file #476

merged 2 commits into from
Feb 7, 2023

Conversation

pnunna93
Copy link
Contributor

@pnunna93 pnunna93 commented Feb 6, 2023

Description
The commit(e08b6d3) installs a rccl version which is causing "undefined symbol: ncclGetLastError" while trying to import torch. Revert it to avoid the error.

@pnunna93 pnunna93 requested a review from a team as a code owner February 6, 2023 18:48
@codecov
Copy link

codecov bot commented Feb 6, 2023

Codecov Report

Merging #476 (b0a5600) into main (121a5dd) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main     #476   +/-   ##
=======================================
  Coverage   87.79%   87.79%           
=======================================
  Files          86       86           
  Lines        5564     5564           
=======================================
  Hits         4885     4885           
  Misses        679      679           
Flag Coverage Δ
cpu-python3.6-unit-test 74.82% <ø> (ø)
cpu-python3.7-unit-test 74.82% <ø> (ø)
cuda-unit-test 87.72% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

@pnunna93
Copy link
Contributor Author

pnunna93 commented Feb 6, 2023

Updated the PR to remove custom rccl building. Please use the rccl package from ROCm release.

@cp5555 cp5555 changed the title Revert "Dockerfile: Update rccl version and fix issue in rocm5.1.1 do… Dockerfile: Remove fixed rccl version in rocm5.1.x docker file Feb 7, 2023
@cp5555 cp5555 added the containers SuperBench Containers label Feb 7, 2023
@cp5555 cp5555 mentioned this pull request Feb 7, 2023
21 tasks
@yukirora
Copy link
Contributor

yukirora commented Feb 7, 2023

/azp run

@azure-pipelines
Copy link

Azure Pipelines successfully started running 3 pipeline(s).

@yukirora yukirora merged commit f21bfef into microsoft:main Feb 7, 2023
@cp5555 cp5555 mentioned this pull request Mar 28, 2023
23 tasks
@yukirora yukirora mentioned this pull request Mar 29, 2023
21 tasks
@yukirora yukirora mentioned this pull request Jul 12, 2023
18 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
containers SuperBench Containers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants