Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to add support for Kosmos-2 Model #2123

Open
VishalBCA-ops opened this issue Jan 25, 2024 · 2 comments
Open

Is it possible to add support for Kosmos-2 Model #2123

VishalBCA-ops opened this issue Jan 25, 2024 · 2 comments
Labels
question Response providing clarification needed. Will not be assigned to a release. (type)

Comments

@VishalBCA-ops
Copy link

Hello Coremltools Team,

I was just checking if It is possible for me to convert "https://huggingface.co/microsoft/kosmos-2-patch14-224" model to support coreml so that I can use it on my mac?

its an Image to Text (Image Captioning Model)

I have tried it now but I could not find any helpful things, Is there any way I or we could add support for this?

Thanks!!!!

❓Question

@VishalBCA-ops VishalBCA-ops added the question Response providing clarification needed. Will not be assigned to a release. (type) label Jan 25, 2024
@YifanShenSZ
Copy link
Collaborator

YifanShenSZ commented Jan 26, 2024

Hi @VishalBCA-ops, many thanks for bringing the awesome kosmos model to our attention! I think this model is in PyTorch format, so probably you can start with trying our PyTorch converter

@Suraj209211
Copy link

@VishalBCA-ops, Hello, I saw the Kosmos model basically it's a mllm model that predict what happen next once the request is generated by the user. Basically model trained on Transformers. text to image based generation.

So I think it's possible to add the support of Kosmos model in CoreML but need to convert the entire thing using onnx-coreml package.

Lemme try using the hugging face code and let you know soon abt this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Response providing clarification needed. Will not be assigned to a release. (type)
Projects
None yet
Development

No branches or pull requests

3 participants