Skip to content

训练配置疑问 #414

@caihuaiguang

Description

@caihuaiguang

Problem Description / 问题描述

我用OpenDCAI/dataflow-instruct-10k数据集,base模型为Qwen2.5-7B-base,learning rate为2e-5,per_device_train_batch_size为1,batch size为128,训练了3个epoch,在C-Eval(评测温度为0)的上:1个epoch准确率为74.81,第2个epoch是75.85,第三个epoch是74.96,和报告中的80.2差距较大;求助论文采用的训练配置,感谢!

System Info (dataflow env) / 系统信息(dataflow env

None

Minimal Reproducible Example / 最小可复现示例

# e.g.
from dataflow.operators.generate import PromptedVQA
...

Additional Information / 其他补充

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions