Support infer n parameter #2893

tastelikefeet · 2025-01-09T10:19:38Z

…9/files

PR type

Write the detail information belongs to this PR.

Paste your experiment result here(if needed).

* commit 'a0d0351400d522392fb4535567bab83d8b9d45b2': Support infer n parameter (modelscope#2893) support multi round dpo (modelscope#2884) fix docs (modelscope#2882) update qlora shell (modelscope#2880) fix bugs (modelscope#2876) fix citest (modelscope#2873) Support ppo (modelscope#2783) fix bugs (modelscope#2869) Update agent demo (modelscope#2867) support mps (modelscope#2866) fix vllm video (modelscope#2864) support reward model train (modelscope#2862) fix jsonl writer (modelscope#2860) Support quant bert reward (modelscope#2859) # Conflicts: # examples/train/rlhf/ppo.sh # swift/trainers/__init__.py # swift/trainers/mixin.py # swift/trainers/rlhf_trainer/ppo_trainer.py

support infer-n from: https://github.com/modelscope/ms-swift/pull/288…

a8a13b8

…9/files

Jintao-Huang approved these changes Jan 9, 2025

View reviewed changes

fix comments

acb6387

tastelikefeet merged commit a0d0351 into modelscope:main Jan 9, 2025
1 of 2 checks passed