Any Explanations about conv_tbc ? Thanks ~~ #172

mali-nuist · 2018-06-06T06:24:54Z

I read the source code but i can't figure out what the operation of conv_tbc is doing , Any explanations ?
For example:
If the shape of input tensor is [10,20](batch_size is 10 and max sentence length is 20) , and then the word embedding resulted in the embed tensor size of [10,20,256](embed size is 256). Then the transposed tensor of size [20,10,256] will be fed into the cov as: x = conv(x)(the conv input channel is 256 and output channel is 512) , which resulted in a new tensor of shape[20, 10, 512] . What the conv is doing ? it seems that the conv treated the embedding axis as channel ?

myleott · 2018-06-06T12:28:23Z

conv_tbc is the same as torch.nn.Conv1d, but accepts a different input shape.

The input shape for nn.Conv1d is batch x channels x time (BCT), which would require a transpose since the rest of the network operates with time x batch x channel (TBC) tensors. conv_tbc takes time x batch x channel (TBC) input directly.

mali-nuist · 2018-06-07T01:43:15Z

thanks for you reply ! i will check the documentation about the torch.nn.Conv1d

facebookresearch#172) * Update feature config (compatible with Lhotse PR facebookresearch#525) * black

convolution with [time,batch,channel] ordering, as opposed to the default [batch, channel, time]. Currently implementing by transposing the input and output, but may need to get its own implementation in the future because this is supposed to be an op that gives a speedup. This is used by fairseq (facebookresearch/fairseq#172). (in case you were wondering like me, this is different from transposed convolution. Transposed convolution has fractional strides). --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Frederik Harwath <frederik.harwath@amd.com>

mali-nuist closed this as completed Jun 7, 2018

myleott added a commit that referenced this issue Jun 26, 2018

Move positional embeddings into LearnedPositionalEmbedding module (#172)

b7051d4

matthijsvk mentioned this issue Mar 29, 2023

[RFC]A suggestion of channels last memory format implementation for 3D tensor pytorch/pytorch#74935

Open

yfyeung pushed a commit to yfyeung/fairseq that referenced this issue Dec 6, 2023

Update feature config (compatible with Lhotse PR facebookresearch#525) (

319e120

facebookresearch#172) * Update feature config (compatible with Lhotse PR facebookresearch#525) * black

This was referenced Jan 11, 2024

torch.aten.conv_tbc conv1d and conv3d nod-ai/SHARK-ModelDev#345

Closed

implement aten.conv1d, aten.conv3d, and aten.conv_tbc llvm/torch-mlir#2757

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any Explanations about conv_tbc ? Thanks ~~ #172

Any Explanations about conv_tbc ? Thanks ~~ #172

mali-nuist commented Jun 6, 2018 •

edited

Loading

myleott commented Jun 6, 2018

mali-nuist commented Jun 7, 2018

Any Explanations about conv_tbc ? Thanks ~~ #172

Any Explanations about conv_tbc ? Thanks ~~ #172

Comments

mali-nuist commented Jun 6, 2018 • edited Loading

myleott commented Jun 6, 2018

mali-nuist commented Jun 7, 2018

mali-nuist commented Jun 6, 2018 •

edited

Loading