Data verify_out_shape not intuitive when there are implicit dims #1153

albertz · 2022-10-19T10:17:25Z

(I use returnn-common code here but nothing is really specific about returnn-common. nn.zeros is just ConstantLayer.)

Consider this code:

x = nn.zeros((dim1, dim2))
x.verify_out_shape({dim1, dim2})

This is correct when dim1 and dim2 are static dims.

Once they are dynamic dims and introduce implicit dims, the verify_out_shape fails because it does not cover the implicit dims. E.g. when dim1 has a dyn_size_ext of shape [B], then currently you actually need:

x.verify_out_shape({dim1, dim2, batch_dim})

Or:

x.verify_out_shape({dim1, dim2, ImplicitDynSizeDim(batch_dim)})

Because the batch dim is an implicit dim.

Why do we have implicit dims? What are those? They were introduced mostly for CumConcatLayer. See #391, #589.

Example of such error: rwth-i6/returnn_common#226

See also the earlier discussion on the introduction of out_shape and verify_out_shape: #706, #757

The text was updated successfully, but these errors were encountered:

albertz · 2022-10-19T10:28:22Z

I'm thinking about whether this automatic introduction of implicit dims was a good idea. This original code really should work, as it would be quite confusing otherwise:

x = nn.zeros((dim1, dim2))
x.verify_out_shape({dim1, dim2})

Maybe these implicit dims should actually be explicitly defined. We still would have them, but they are explicitly defined. E.g. a layer like CumConcatLayer would explicitly add those. How would that work? Is this attached to another dim tag? Some attrib like Dim.implicit_dims? Or attached to the Data?

albertz · 2022-10-19T10:28:40Z

@Zettelkasten what do you think about this? Specifically, here (#706 (comment)) you said:

Do we want to force the user to specify implicit dims if they specify out_shape? I tend to think yes (if you think about recurrent self attention e.g., it's super important that the keys tensor has an implicit query-time dim. I think, if I were to write this down on paper, I would also include it always.)

albertz · 2022-10-19T10:40:55Z

We could also allow returnn-common to be a bit more relaxed about this and verify_out_shape on returnn-common side would allow to ignore implicit dims, or maybe have an option check_implicit_dims which is False by default.

#1153

rwth-i6/returnn#1153 Fix #226

albertz · 2022-10-19T11:22:18Z

We could also allow returnn-common to be a bit more relaxed about this and verify_out_shape on returnn-common side would allow to ignore implicit dims, or maybe have an option check_implicit_dims which is False by default.

I did that now, just to fix rwth-i6/returnn_common#226 for now. However, I think this issue here remains, and we should think about it more. Maybe implement my suggestion here on handling it explicitly?

rwth-i6/returnn#1153

albertz mentioned this issue Oct 19, 2022

Error when verifying output shape for relative pos emb rwth-i6/returnn_common#226

Closed

albertz added a commit that referenced this issue Oct 19, 2022

Data verify_out_shape allow_missing_implicit_dims option

8ae5332

#1153

albertz added a commit that referenced this issue Oct 19, 2022

test_Data_verify_out_shape_optional_implicit_dim

c099773

#1153

albertz added a commit to rwth-i6/returnn_common that referenced this issue Oct 19, 2022

Tensor verify_out_shape, ignore missing implicit dims

0727228

rwth-i6/returnn#1153 Fix #226

albertz mentioned this issue Oct 22, 2022

Dim internals and API should be refactored #975

Open

albertz added a commit to rwth-i6/returnn_common that referenced this issue Dec 13, 2022

Tensor.shape without implicit dims

b6555a9

rwth-i6/returnn#1153

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data verify_out_shape not intuitive when there are implicit dims #1153

Data verify_out_shape not intuitive when there are implicit dims #1153

albertz commented Oct 19, 2022 •

edited

Loading

albertz commented Oct 19, 2022

albertz commented Oct 19, 2022 •

edited

Loading

albertz commented Oct 19, 2022

albertz commented Oct 19, 2022

Data verify_out_shape not intuitive when there are implicit dims #1153

Data verify_out_shape not intuitive when there are implicit dims #1153

Comments

albertz commented Oct 19, 2022 • edited Loading

albertz commented Oct 19, 2022

albertz commented Oct 19, 2022 • edited Loading

albertz commented Oct 19, 2022

albertz commented Oct 19, 2022

albertz commented Oct 19, 2022 •

edited

Loading

albertz commented Oct 19, 2022 •

edited

Loading