Skip to content

Commit

Permalink
fix-step3-readme (microsoft#286)
Browse files Browse the repository at this point in the history
* fix-step3-readme

* Update README.md

---------

Co-authored-by: Zhewei Yao <zheweiy@berkeley.edu>
  • Loading branch information
zhangfanTJU and yaozhewei authored Apr 28, 2023
1 parent 354fc7f commit a3e4857
Showing 1 changed file with 1 addition and 4 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -7,9 +7,6 @@ If you don't have step 1 and step 2 models. You may simply try
``` bash
--actor_model_name_or_path facebook/opt-1.3b --critic_model_name_or_path facebook/opt-350m
```
⚡⚡⚡ When you use above script, please make sure you comment out the following such that it won't load the model weight from previous paths.
```bash
applications/DeepSpeed-Chat/training/utils/model/model_utils.py#L60
```
⚡⚡⚡ When you use above script, please make sure you modify parameter `rlhf_training` to False when calling the `create_critic_model` function twice in [rlhf_engine.py](./../../step3_rlhf_finetuning/rlhf_engine.py) such that it won't load the model weight from previous paths.

For the models we support, please see [our landing page](./../../../README.md#-supported-models-)

0 comments on commit a3e4857

Please sign in to comment.