[OpenVINO] Support Arcee Trinity (aka Afmoe) models collection #1569

rkazants · 2025-12-18T14:19:39Z

What does this PR do?

Note: Currently, the model is a remote-code model for transformers "4.x.y" version.

Example of conversion cmd-line for arcee-ai/Trinity-Nano-Preview:

optimum-cli export openvino -m arcee-ai/Trinity-Nano-Preview Trinity-Nano-Preview --trust-remote-code

Example of inference for arcee-ai/Trinity-Nano-Preview using OpenVINO backend:

from transformers import AutoTokenizer
from optimum.intel.openvino import OVModelForCausalLM

model_path = "arcee-ai/Trinity-Nano-Preview"

tokenizer = AutoTokenizer.from_pretrained(model_path)
model = OVModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)

# change input text as desired
input_text = "The capital of France is"
# tokenize the text
input_tokens = tokenizer(input_text, return_tensors="pt")
# generate output tokens
output = model.generate(**input_tokens, max_length=10)
# decode output tokens into text
output = tokenizer.batch_decode(output)
print(output[0])

Before submitting

[N/A] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-12-18T14:21:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nikita-savelyevv

I observe that the export and inference takes a very long time. Could you please add the same warning as the one we've added recently for Zamba2?

Another issue is that weight quantization is very slow because there are 20k weight constants to quantize. I believe we had a similar issue for Qwen/Qwen3-30B-A3B until MoE merging was added to OpenVINO. Do you know, is it expected for the same approach to work for this model?

Also, please take a look at the failing tests.

echarlaix · 2026-01-07T14:41:06Z

optimum/exporters/openvino/model_configs.py

+    library_name="transformers",
+)
+class AfmoeOpenVINOConfig(LlamaOpenVINOConfig):
+    MIN_TRANSFORMERS_VERSION = "4.55.4"


Since this architecture is already supported in transformers https://github.com/huggingface/transformers/blob/v5.0.0rc0/src/transformers/models/afmoe/modeling_afmoe.py I think we shouldn't add support to the remote code version. We would also need to add support to the latest transformers version (will open a PR for this soon)

Hi @echarlaix, we have urgent request from customer for this model. optimum-intel doesn't support v5 transformers yet. I propose to proceed with remote-code support for v4 now.

…inity

IlyasMoutawwakil · 2026-01-08T08:59:34Z

thanks for the addition @rkazants
there are still some failures in export, can you check those please.

[OpenVINO] Support Arcee Trinity models collection

96023b1

rkazants changed the title ~~[OpenVINO] Support Arcee Trinity models collection~~ [OpenVINO] Support Arcee Trinity (aka Afmoe) models collection Dec 18, 2025

rkazants added 2 commits December 18, 2025 14:25

Document the model

6196828

Add tests

8dc3e88

rkazants requested review from IlyasMoutawwakil, andrey-churkin, echarlaix, nikita-savelyevv and popovaan and removed request for IlyasMoutawwakil, echarlaix and nikita-savelyevv December 18, 2025 14:41

nikita-savelyevv added the openvino-slow Runs OpenVINO slow tests with different versions of transformers label Dec 18, 2025

nikita-savelyevv reviewed Dec 18, 2025

View reviewed changes

echarlaix reviewed Jan 7, 2026

View reviewed changes

rkazants added 2 commits January 8, 2026 10:39

Merge remote-tracking branch 'origin/support_trinity' into support_tr…

bfefc19

…inity

Merge remote-tracking branch 'upstream/main' into support_trinity

4f6db0f

rkazants removed the openvino-slow Runs OpenVINO slow tests with different versions of transformers label Jan 8, 2026

Add test case to remote-code

6f7d8d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenVINO] Support Arcee Trinity (aka Afmoe) models collection #1569

[OpenVINO] Support Arcee Trinity (aka Afmoe) models collection #1569

rkazants commented Dec 18, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Dec 18, 2025

Uh oh!

nikita-savelyevv left a comment

Uh oh!

echarlaix Jan 7, 2026

Uh oh!

rkazants Jan 8, 2026

Uh oh!

IlyasMoutawwakil commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[OpenVINO] Support Arcee Trinity (aka Afmoe) models collection #1569

Are you sure you want to change the base?

[OpenVINO] Support Arcee Trinity (aka Afmoe) models collection #1569

Conversation

rkazants commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Dec 18, 2025

Uh oh!

nikita-savelyevv left a comment

Choose a reason for hiding this comment

Uh oh!

echarlaix Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

rkazants Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rkazants commented Dec 18, 2025 •

edited

Loading