-
Notifications
You must be signed in to change notification settings - Fork 27.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add deepspeed test to amd scheduled CI #27633
Merged
Merged
Changes from 1 commit
Commits
Show all changes
35 commits
Select commit
Hold shift + click to select a range
1e8ce66
add deepspeed scheduled test for amd
echarlaix bf276ed
fix image
echarlaix 2cfb53d
add dockerfile
echarlaix 5a9a529
add comment
echarlaix af46e87
enable tests
echarlaix c29d249
trigger
echarlaix a0c3daf
remove trigger for this branch
echarlaix 4cb9d6f
trigger
echarlaix a703349
change runner env to trigger the docker build image test
echarlaix a47ac2c
use new docker image
echarlaix 233bd7f
remove test suffix from docker image tag
echarlaix 971ba80
replace test docker image with original image
echarlaix da4774c
push new image
echarlaix cbe995f
Trigger
echarlaix e16c271
add back amd tests
echarlaix 70c3580
fix typo
echarlaix 090b88e
add amd tests back
echarlaix 508ae29
fix
echarlaix 09fee9e
comment until docker image build scheduled test fix
echarlaix 407cfe9
remove deprecated deepspeed build option
echarlaix f846b80
upgrade torch
echarlaix 785b63a
update docker & make tests pass
ba8cc9f
Merge branch 'main' into run_amd_scheduled_ci_caller_deepspeed_test
fxmarty f0f931e
Update docker/transformers-pytorch-deepspeed-amd-gpu/Dockerfile
fxmarty 40398b9
fix
echarlaix 3332cd2
tmp disable test
echarlaix 9696cc4
precompile deepspeed to avoid timeout during tests
echarlaix 84a7a33
fix comment
echarlaix fc6d890
Merge branch 'main' into run_amd_scheduled_ci_caller_deepspeed_test
echarlaix df00cff
trigger deepspeed tests with new image
echarlaix 92c402d
comment tests
echarlaix fa82a9c
trigger
echarlaix ecb9239
add sklearn dependency to fix slow tests
echarlaix cfcc312
enable back other tests
echarlaix ae82b3f
final update
ydshieh File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Next
Next commit
add deepspeed scheduled test for amd
- Loading branch information
commit 1e8ce6607b49ca32c009e4e86467df8ae11a84b6
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe add
to be the same as in other workflow file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not sure to understand why we need to uninstall and reinstall deepspeed here, what issue does it solve ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't remember exactly, it has been one year or more ago. I can try to find from the history if you would like to have the information.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in our case we don't need it at the moment, so does it work if we keep it that way ? If you want me to uninstall / reinstall it in the tests, I can directly update and use
huggingface/transformers-pytorch-amd-gpu
and just install deepspeed in the tests directly (to avoid this step)