How is the FP16 model trained? #1985

icestoneking · 2024-08-05T05:41:30Z

Notice: In order to resolve issues more efficiently, please raise issue following the template.
（注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节）

❓How is the FP16 model trained? Can I save the FP16 model as a normal model after training?

用ddp训练:
++train_conf.use_fp16=true,最后保存的模型仍为fp32
用deepseed训练:
FunASR/funasr/models/sanm/attention.py", line 518, in forward
[rank0]: inputs = inputs * mask
[rank0]: ~~~~~~~^~~~~~
[rank0]: RuntimeError: The size of tensor a (0) must match the size of tensor b (74) at non-singleton dimension 1

The text was updated successfully, but these errors were encountered:

icestoneking added the question Further information is requested label Aug 5, 2024