Conformer misses relative pos encoding #132

albertz · 2022-04-22T21:38:55Z

There is self attention without any pos encoding.

Related is Transformer rel pos #74.

There probably should be a nn.RelPosSelfAttention, similar as nn.SelfAttention but with rel pos encoding. It should use the Transformer XL rel pos encoding. (We can separately test other rel pos encodings later.)

The text was updated successfully, but these errors were encountered:

albertz · 2022-10-12T10:46:27Z

See here for an example: https://github.com/espnet/espnet/blob/4138010fb66ad27a43e8bee48a4932829a0847ae/espnet2/asr/encoder/conformer_encoder.py#L132
(Up-to-date 2022-10-17.)

RelPositionMultiHeadedAttention

RelPositionalEncoding

The RelPositionalEncoding is then also part of the frontend (e.g. Conv2dSubsampling6) (also see #219).
Specifically, at the end of the frontend, there is another Linear to the model dim, and then the pos enc (here).

RelPositionalEncoding must be used together with RelPositionMultiHeadedAttention.

albertz · 2022-10-17T13:05:22Z

Note that all our existing Conformer recipes in RETURNN currently use the RETURNN RelativePositionalEncodingLayer (example). I.e. that is not the Transformer XL variant, as it is standard now. We probably should directly implement the Transformer XL variant now.

But for direct comparisons to old setups, we might also need to implement the old variant.

albertz · 2022-10-17T13:54:21Z

Interestingly, the Fairseq RelPositionalEncoding looks almost identical to the ESPnet implementation? It looks like the code was copied. But there is no mention at all about this. Fairseq is MIT, ESPnet is Apache licence. (@sravyapopuri388 @sw005320 @pengchengguo maybe you know the history?)

albertz · 2022-10-17T14:21:55Z

For reference, the original Transformer-XL rel_multihead_attn implementation.

pengchengguo · 2022-10-17T14:58:36Z

Hi @albertz,

I think the Fairseq RelPositionalEncoding is based on ESPnet as they distinguish different RPE/Attention types between Fairseq and ESPnet here: https://github.com/facebookresearch/fairseq/blob/b7b7928065ec90bef8f9a489b0128a4dce560d57/fairseq/modules/conformer_layer.py#L187.

sravyapopuri388 · 2022-10-17T15:03:05Z

@albertz The fairseq implementation is based on the Espnet one for relative positional encoding. Seems like we are indeed missing some citations. Will make sure to add them soon. Thanks!

#132

albertz assigned tbscode, mmz33, albertz and Atticus1806 Apr 22, 2022

albertz added this to the first-release milestone Apr 22, 2022

albertz mentioned this issue Apr 22, 2022

Missing pieces for first release #32

Open

albertz mentioned this issue Oct 12, 2022

Conformer frontend should fix dimensions, be more standard #219

Open

albertz unassigned tbscode Oct 12, 2022

albertz mentioned this issue Oct 17, 2022

SelfAttention misses Linear after attention, wrong for Conformer, Transformer #221

Closed

albertz assigned JackTemaki Oct 17, 2022

albertz added a commit that referenced this issue Oct 17, 2022

relative_positional_encoding

e92d6d2

#132

albertz added a commit that referenced this issue Oct 17, 2022

relative_positional_encoding cache

50690b5

#132

albertz closed this as completed in 328faa6 Oct 18, 2022

albertz mentioned this issue Nov 2, 2022

Create good Conformer baselines #233

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conformer misses relative pos encoding #132

Conformer misses relative pos encoding #132

albertz commented Apr 22, 2022 •

edited

Loading

albertz commented Oct 12, 2022 •

edited

Loading

albertz commented Oct 17, 2022 •

edited

Loading

albertz commented Oct 17, 2022

albertz commented Oct 17, 2022

pengchengguo commented Oct 17, 2022

sravyapopuri388 commented Oct 17, 2022

Conformer misses relative pos encoding #132

Conformer misses relative pos encoding #132

Comments

albertz commented Apr 22, 2022 • edited Loading

albertz commented Oct 12, 2022 • edited Loading

albertz commented Oct 17, 2022 • edited Loading

albertz commented Oct 17, 2022

albertz commented Oct 17, 2022

pengchengguo commented Oct 17, 2022

sravyapopuri388 commented Oct 17, 2022

albertz commented Apr 22, 2022 •

edited

Loading

albertz commented Oct 12, 2022 •

edited

Loading

albertz commented Oct 17, 2022 •

edited

Loading