Skip to content

Commit

Permalink
[deepspeed] offload + non-cpuadam optimizer exception doc (huggingfac…
Browse files Browse the repository at this point in the history
…e#22044)

* [deepspeed] offload + non-cpuadam optimizer exception doc

* deps
  • Loading branch information
stas00 authored Mar 22, 2023
1 parent 5990743 commit 89a0a9e
Show file tree
Hide file tree
Showing 3 changed files with 13 additions and 4 deletions.
13 changes: 11 additions & 2 deletions docs/source/en/main_classes/deepspeed.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -1293,8 +1293,17 @@ If you want to use another optimizer which is not listed above, you will have to
}
```

Similarly to `AdamW`, you can configure other officially supported optimizers. Just remember that may have different
config values. e.g. for Adam you will want `weight_decay` around `0.01`.
Similarly to `AdamW`, you can configure other officially supported optimizers. Just remember that those may have different config values. e.g. for Adam you will want `weight_decay` around `0.01`.

Additionally, offload works the best when it's used with Deepspeed's CPU Adam optimizer. If you want to use a different optimizer with offload, since `deepspeed==0.8.3` you need to also add:


```json
{
"zero_force_ds_cpu_optimizer": false
}
```
to the top level configuration.



Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -106,7 +106,7 @@
"dataclasses",
"datasets!=2.5.0",
"decord==0.6.0",
"deepspeed>=0.6.5",
"deepspeed>=0.8.3",
"dill<0.3.5",
"evaluate>=0.2.0",
"fairscale>0.3",
Expand Down
2 changes: 1 addition & 1 deletion src/transformers/dependency_versions_table.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
"dataclasses": "dataclasses",
"datasets": "datasets!=2.5.0",
"decord": "decord==0.6.0",
"deepspeed": "deepspeed>=0.6.5",
"deepspeed": "deepspeed>=0.8.3",
"dill": "dill<0.3.5",
"evaluate": "evaluate>=0.2.0",
"fairscale": "fairscale>0.3",
Expand Down

0 comments on commit 89a0a9e

Please sign in to comment.