arm64 quantized model not working with ort. #345

narendra9079 · 2025-02-04T07:13:46Z

narendra9079
Feb 4, 2025

I am trying to run the quantized version of https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2/blob/main/onnx/model_qint8_arm64.onnx. on aarch64 architecture. But is not able to generate embedding. It is failing when I run sessin.run() function. But the avx2.onnx version is running fine on linux. I am using ort = { version = "=2.0.0-rc.0", default-features = false, features=["ndarray"], optional = true }
ort-sys = { version = "=2.0.0-rc.0", optional = true } these version for onnx run time.

Can you please provide some opinion ??

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arm64 quantized model not working with ort. #345

{{title}}

Replies: 0 comments

Select a reply

arm64 quantized model not working with ort. #345

narendra9079 Feb 4, 2025

Replies: 0 comments

narendra9079
Feb 4, 2025