-
Notifications
You must be signed in to change notification settings - Fork 148
Open
Labels
questionFurther information is requestedFurther information is requested
Description
Problem Description / 问题描述
我用OpenDCAI/dataflow-instruct-10k数据集,base模型为Qwen2.5-7B-base,learning rate为2e-5,per_device_train_batch_size为1,batch size为128,训练了3个epoch,在C-Eval(评测温度为0)的上:1个epoch准确率为74.81,第2个epoch是75.85,第三个epoch是74.96,和报告中的80.2差距较大;求助论文采用的训练配置,感谢!
System Info (dataflow env) / 系统信息(dataflow env)
None
Minimal Reproducible Example / 最小可复现示例
# e.g.
from dataflow.operators.generate import PromptedVQA
...
Additional Information / 其他补充
No response
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested