You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This method serves as the easiest entry point into using QEfficient. The interface is designed to be similar to transformers.AutoModel.
131
131
Once the model is initialized, you can use other methods such as export, compile, and generate on the same object.
132
132
133
+
This API can also be used as exception for VLM model since transformers support loading InternChatVL models via AutoModel API we support it via AutoModelForCausalLM API
133
134
Args:
134
135
:pretrained_name_or_path (str): Model card name from HuggingFace or local path to model directory.
135
136
:args, kwargs: Additional arguments to pass to transformers.AutoModel.
This method serves as the easiest entry point into using QEfficient. The interface is designed to be similar to transformers.AutoModelForCausalLM.
1229
1254
Once the model is initialized, you can use other methods such as export, compile, and generate on the same object.
1230
1255
1256
+
This API can also be used as exception for VLM model since transformers support loading InternChatVL models via AutoModel API we support it via AutoModelForCausalLM API
1231
1257
Args:
1232
1258
:pretrained_name_or_path (str): Model card name from HuggingFace or local path to model directory.
1233
1259
:continuous_batching (bool): Whether this model will be used for continuous batching in future. If this is not set True here, the model can not be exported/compiled for continuous batching later.
0 commit comments