You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wanted to know why the prepare_input_labels_for_multimodal function in llava_arch.py is designed to throw an exception during pretraining if the mm_use_im_start_end option is enabled:
# TODO: image start / end is not implemented here to support pretraining.ifgetattr(self.config, 'tune_mm_mlp_adapter', False) andgetattr(self.config, 'mm_use_im_start_end', False):
raiseNotImplementedError
Shouldn't the function work fine even if there are <im_start> and <im_end> tokens around the image tokens?
The text was updated successfully, but these errors were encountered:
I wanted to know why the
prepare_input_labels_for_multimodal
function inllava_arch.py
is designed to throw an exception during pretraining if themm_use_im_start_end
option is enabled:Shouldn't the function work fine even if there are <im_start> and <im_end> tokens around the image tokens?
The text was updated successfully, but these errors were encountered: