Open
Description
There was previously a default learning rate for MADLAD, but all the experiments run to determine the value used LoRA models. Since the learning rate experiments for NLLB and LoRA yielded the same value, it makes sense to reevaluate the default value for MADLAD once we are able to test MADLAD without using LoRA.
Metadata
Metadata
Assignees
Type
Projects
Status
📋 Backlog