[TFLite] Enable int64 biases for int16 quantized operators #12042

leandron · 2022-07-08T16:04:29Z

This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend.

cc @areusch for reviews

Mousius

Can we add some test cases for this please @leandron 😸

kuladeep2706 · 2022-10-07T12:31:14Z

Hello @leandron,

I'm working on similar lines & have a model with conv2d_transpose & all the other ops are already supported from your already merged commit. I've made the same changes you've done for conv2d_transpose from this patch, but the dequantize layer at the end is getting int64 input which isn't right. Am I missing something that needs to be changed?

Thanks in advance!

leandron · 2022-11-08T16:22:25Z

Hello @leandron,

I'm working on similar lines & have a model with conv2d_transpose & all the other ops are already supported from your already merged commit. I've made the same changes you've done for conv2d_transpose from this patch, but the dequantize layer at the end is getting int64 input which isn't right. Am I missing something that needs to be changed?

Thanks in advance!

In TFlite as of now, biases are set by default to be int64 when int16 quantisation is used.

I have this model which was created using the default int16 flow, and can be used to check these internal data types with e.g. Netron

tvm-bot · 2022-11-09T13:31:51Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Built docs for commit 98568b2 can be found here.

_{Generated by tvm-bot}

This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend. Add a test case using DS_CNN int16 quantized. Change-Id: I3006ee76f5037fb6f915818358c9aada2faf40bf

leandron · 2022-11-09T17:30:31Z

Please have another look.

ashutosh-arm

Overall looks good to me. Do you know of any links to int16 specs similar to https://www.tensorflow.org/lite/performance/quantization_spec (int8 only)?

src/relay/qnn/op/dense.cc

tests/python/contrib/test_ethosn/test_convert_equivalents.py

ekalda

Thanks @leandron, looks good to me!

ashutosh-arm

Thanks @leandron. LGTM 😄

Mousius · 2022-11-15T10:31:15Z

Sorry for the delay - thanks @leandron 😸

) This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend. Add a test case using DS_CNN int16 quantized.

github-actions bot requested a review from areusch July 8, 2022 16:04

Mousius requested changes Jul 8, 2022

View reviewed changes

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

leandron force-pushed the int16_ops_dense_requantize branch from 7eb64a3 to a940412 Compare November 8, 2022 16:20

leandron force-pushed the int16_ops_dense_requantize branch 3 times, most recently from 5b67c87 to 1846d00 Compare November 9, 2022 10:54

leandron force-pushed the int16_ops_dense_requantize branch from 1846d00 to 98568b2 Compare November 9, 2022 15:23

ashutosh-arm reviewed Nov 10, 2022

View reviewed changes

src/relay/qnn/op/dense.cc Show resolved Hide resolved

tests/python/contrib/test_ethosn/test_convert_equivalents.py Show resolved Hide resolved

ekalda approved these changes Nov 10, 2022

View reviewed changes

ashutosh-arm approved these changes Nov 10, 2022

View reviewed changes

Mousius approved these changes Nov 15, 2022

View reviewed changes

Mousius merged commit 034dc67 into apache:main Nov 15, 2022

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFLite] Enable int64 biases for int16 quantized operators #12042

[TFLite] Enable int64 biases for int16 quantized operators #12042

leandron commented Jul 8, 2022 •

edited

Loading

Mousius left a comment

kuladeep2706 commented Oct 7, 2022

leandron commented Nov 8, 2022

tvm-bot commented Nov 9, 2022 •

edited

Loading

leandron commented Nov 9, 2022 •

edited

Loading

ashutosh-arm left a comment

ekalda left a comment

ashutosh-arm left a comment

Mousius commented Nov 15, 2022

[TFLite] Enable int64 biases for int16 quantized operators #12042

[TFLite] Enable int64 biases for int16 quantized operators #12042

Conversation

leandron commented Jul 8, 2022 • edited Loading

Mousius left a comment

Choose a reason for hiding this comment

kuladeep2706 commented Oct 7, 2022

leandron commented Nov 8, 2022

tvm-bot commented Nov 9, 2022 • edited Loading

leandron commented Nov 9, 2022 • edited Loading

ashutosh-arm left a comment

Choose a reason for hiding this comment

ekalda left a comment

Choose a reason for hiding this comment

ashutosh-arm left a comment

Choose a reason for hiding this comment

Mousius commented Nov 15, 2022

leandron commented Jul 8, 2022 •

edited

Loading

tvm-bot commented Nov 9, 2022 •

edited

Loading

leandron commented Nov 9, 2022 •

edited

Loading