trainer.save_checkpoint
doesn't work after trainer.test
with deepspeed strategy
#15247
Labels
Milestone
Bug description
Reported here: #14944 (comment)
Reason? Read the thread: #14944 (comment)
in short
does not work.
Either we need to update the strategy somehow or improve the support in the deepspeed package itself to allow saving the checkpoint without any optimizer.
Full repro:
Environment
More info
Issue on DeepSpeed GitHub: microsoft/DeepSpeed#3601
The text was updated successfully, but these errors were encountered: