Skip to content

Conversation

@abukhoy
Copy link
Contributor

@abukhoy abukhoy commented Oct 14, 2025

This pull request is created for updating the onnx opset version to 17 from 13.

Testing

Below are the models I have tested:

Causal Models

  • TinyLlama/TinyLlama-1.1B-Chat-v1.0
  • gpt2
  • Salesforce/codegen-350M-mono
  • microsoft/Phi-3-mini-4k-instruct
  • tiiuae/falcon-7b
  • Qwen/Qwen2-0.5B
  • Qwen/Qwen3-0.6B
  • bigcode/starcoder2-3b
  • Qwen/Qwen3-30B-A3B-Instruct-2507
  • Felladrin/Minueza-32M-Base
  • wtang06/mpt-125m-c4
  • hakurei/gpt-j-random-tinier
  • mistralai/Mixtral-8x7B-Instruct-v0.1
  • meta-llama/Llama-3.2-1B
  • unsloth/gemma-2b
  • unsloth/gemma-2-2b
  • TheBloke/TinyLlama-1.1B-Chat-v0.3-AWQ
  • TheBloke/Llama-2-7B-GPTQ
  • ibm-granite/granite-20b-code-base
  • neuralmagic/Llama-3.2-3B-Instruct-FP8
  • neuralmagic/Qwen2-0.5B-Instruct-FP8
  • ibm-granite/granite-3.1-2b-instruct
  • ibm-granite/granite-guardian-3.1-2b
  • hpcai-tech/grok-1
  • Snowflake/Llama-3.1-SwiftKV-8B-Instruct
  • allenai/OLMo-2-0425-1B

Embedding Models

  • BAAI/bge-base-en-v1.5
  • BAAI/bge-large-en-v1.5
  • BAAI/bge-small-en-v1.5
  • intfloat/e5-large-v2
  • sentence-transformers/multi-qa-mpnet-base-cos-v1
  • ibm-granite/granite-embedding-30m-english
  • ibm-granite/granite-embedding-125m-english
  • BAAI/bge-reranker-v2-m3
  • ibm-granite/granite-embedding-107m-multilingual
  • ibm-granite/granite-embedding-278m-multilingual

Vision Models

  • llava-hf/llava-1.5-7b-hf
  • OpenGVLab/InternVL2_5-1B
  • meta-llama/Llama-3.2-11B-Vision-Instruct
  • ibm-granite/granite-vision-3.2-2b
  • meta-llama/Llama-4-Scout-17B-16E-Instruct
  • google/gemma-3-4b-it

Audio Models

  • openai/whisper-tiny
  • openai/whisper-base
  • openai/whisper-small
  • openai/whisper-medium
  • openai/whisper-large
  • openai/whisper-large-v3-turbo

Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
@ochougul ochougul merged commit ed965fd into quic:main Nov 13, 2025
5 checks passed
abhishek-singh591 pushed a commit to quic-rishinr/efficient-transformers that referenced this pull request Nov 25, 2025
This pull request is created for updating the _onnx opset_ version to 17
from 13.

Below are the models I have tested:
- TinyLlama/TinyLlama-1.1B-Chat-v1.0
- gpt2
- Salesforce/codegen-350M-mono
- microsoft/Phi-3-mini-4k-instruct
- tiiuae/falcon-7b
- Qwen/Qwen2-0.5B
- Qwen/Qwen3-0.6B
- bigcode/starcoder2-3b
- Qwen/Qwen3-30B-A3B-Instruct-2507
- Felladrin/Minueza-32M-Base
- wtang06/mpt-125m-c4
- hakurei/gpt-j-random-tinier
- mistralai/Mixtral-8x7B-Instruct-v0.1
- meta-llama/Llama-3.2-1B
- unsloth/gemma-2b
- unsloth/gemma-2-2b
- TheBloke/TinyLlama-1.1B-Chat-v0.3-AWQ
- TheBloke/Llama-2-7B-GPTQ
- ibm-granite/granite-20b-code-base
- neuralmagic/Llama-3.2-3B-Instruct-FP8
- neuralmagic/Qwen2-0.5B-Instruct-FP8
- ibm-granite/granite-3.1-2b-instruct
- ibm-granite/granite-guardian-3.1-2b
- hpcai-tech/grok-1
- Snowflake/Llama-3.1-SwiftKV-8B-Instruct
- allenai/OLMo-2-0425-1B

- BAAI/bge-base-en-v1.5
- BAAI/bge-large-en-v1.5
- BAAI/bge-small-en-v1.5
- intfloat/e5-large-v2
- sentence-transformers/multi-qa-mpnet-base-cos-v1
- ibm-granite/granite-embedding-30m-english
- ibm-granite/granite-embedding-125m-english
- BAAI/bge-reranker-v2-m3
- ibm-granite/granite-embedding-107m-multilingual
- ibm-granite/granite-embedding-278m-multilingual

- llava-hf/llava-1.5-7b-hf
- OpenGVLab/InternVL2_5-1B
- meta-llama/Llama-3.2-11B-Vision-Instruct
- ibm-granite/granite-vision-3.2-2b
- meta-llama/Llama-4-Scout-17B-16E-Instruct
- google/gemma-3-4b-it

- openai/whisper-tiny
- openai/whisper-base
- openai/whisper-small
- openai/whisper-medium
- openai/whisper-large
- openai/whisper-large-v3-turbo

---------

Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants