Skip to content

Commit a50b66b

Browse files
authored
Update README.md
1 parent cc58ee3 commit a50b66b

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

ML tips/NLP/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -899,4 +899,5 @@ https://github.com/OpenLMLab/MOSS-RLHF/tree/main
899899

900900
- reward model: https://github.com/OpenLMLab/MOSS-RLHF/blob/main/train_ppo.py#L113
901901
- note need to train own reward model, but can use hf trainer or something, use above as guide
902+
- see `reward_model` folder
902903
- PPO train https://github.com/OpenLMLab/MOSS-RLHF/blob/main/run_en.sh

0 commit comments

Comments
 (0)