Add masked LSTM support #2030

q-ycong-p · 2022-08-27T01:38:10Z

Masking is a intra-layer behavior in TF LSTM [1] but is not a intra-op behavior in ONNX LSTM [2]. When converted to ONNX, masked TF LSTM layer is converted to Loop op. This over-complicates the ONNX model, and has a negative impact on inference performance in ORT without leveraging LSTM optimizations. (issue #1871)

This commit adds support to convert masked LSTM correctly, under the important assumption that input must be post-padded - which is the most common use case. The "masking" info is conveyed to ONNX LSTM op as sequence_lens which is dynamically computed by summing the number of non-skip timesteps per batch per-LSTM. This behavior is implemented with reference to keras2onnx PR#386 [3]. Additional logic is added for backward LSTM so that the input sequence is reversed correctly given sequence_lens.

Note that if mask-enabled, and LSTM input is pre- or randomly padded, the converted ONNX model will behave incorrectly for inference. Unless ONNX add new attribute e.g. mask_enabled to RNN ops, converter alone may not be able to handle generic masking while keeping the RNN ops, since masking alters intra-op behavior. With such limitation, I'd like to share this PR for further comment and suggestion.

[1] https://www.tensorflow.org/guide/keras/masking_and_padding#masking
[2] https://github.com/onnx/onnx/blob/main/docs/Operators.md#LSTM
[3] onnx/keras-onnx#386

Details:

Forward LSTM

Here's an minimal example with an embedded LSTM (mask_zeros=True):

H5 model:

tf2onnx-converted ONNX model, before proposed change:

tf2onnx-converted ONNX model, after proposed change:

Reverse LSTM

Need to alter tf.raw_op.ReverseV2->ReverseSequence behavior to reverse LSTM input correctly:

Signed-off-by: Yu Cong <congyc@amazon.com>

q-ycong-p · 2022-10-17T18:43:08Z

Sorry will address the test failures on TF-2.9 soon.

AndreyOrb · 2023-08-18T17:28:29Z

Hello,
Is there any progress with this issue?

AndreyOrb · 2024-01-02T20:51:06Z

Hi,
Is there any update? Will the proposed code work if pulled?

AndreyOrb · 2025-07-07T16:58:04Z

@xadupre Hello Xavier. I've been waiting for this PR for a few years now. Could you assist, please?

Add masked LSTM support

d4e16b5

Signed-off-by: Yu Cong <congyc@amazon.com>

q-ycong-p force-pushed the post-padded-lstm branch from 9775fa5 to d4e16b5 Compare August 27, 2022 05:22

q-ycong-p closed this Aug 27, 2022

q-ycong-p reopened this Oct 12, 2022

Merge branch 'main' into post-padded-lstm

232a536

xadupre added 2 commits February 4, 2025 15:13

Merge branch 'main' into post-padded-lstm

eb1c8dc

Merge branch 'main' into post-padded-lstm

7303d9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add masked LSTM support #2030

Add masked LSTM support #2030

Uh oh!

q-ycong-p commented Aug 27, 2022

Uh oh!

q-ycong-p commented Oct 17, 2022

Uh oh!

AndreyOrb commented Aug 18, 2023

Uh oh!

AndreyOrb commented Jan 2, 2024

Uh oh!

AndreyOrb commented Jul 7, 2025

Uh oh!

Uh oh!

Add masked LSTM support #2030

Are you sure you want to change the base?

Add masked LSTM support #2030

Uh oh!

Conversation

q-ycong-p commented Aug 27, 2022

Details:

Forward LSTM

Reverse LSTM

Uh oh!

q-ycong-p commented Oct 17, 2022

Uh oh!

AndreyOrb commented Aug 18, 2023

Uh oh!

AndreyOrb commented Jan 2, 2024

Uh oh!

AndreyOrb commented Jul 7, 2025

Uh oh!

Uh oh!