About the attention calculation in FASA #14

csguoh · 2023-10-01T08:27:52Z

Hi, authors.
This work does inspire me a lot! I have a question about the Frequency domain-based self-attention solver
In this line, it seems that you directly use element-wise multiplication, however in classic attention, Matul (or @) is used.
I can not find any explanation in the paper, so could you give me some insight about this? thanks:D

The text was updated successfully, but these errors were encountered:

ccyppl · 2023-10-10T01:46:04Z

你好，作者们。
对于这里并没有使用矩阵乘法而是对应元素相乘。
我也对此问题感到困惑，期望得到解答，谢谢。

HanzhouLiu · 2023-10-13T00:06:43Z

I guess the reason why the authors did the element-wise product is that multiplication in frequency domain is equivalent to conv in space domain.

haodongzhang0118 · 2023-10-13T03:04:11Z

But from the code, when the authors do the element-wise multiplication in this line, they have already come back to the spatial domain because of this line I think. The calculation is not in the frequency domain.

HanzhouLiu · 2023-10-13T03:22:41Z

Just my opinion,
out = self.norm(out) # calculate the score matrix
output = v * out # multiply the v matrix by the score matrix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the attention calculation in FASA #14

About the attention calculation in FASA #14

csguoh commented Oct 1, 2023

ccyppl commented Oct 10, 2023

HanzhouLiu commented Oct 13, 2023

haodongzhang0118 commented Oct 13, 2023

HanzhouLiu commented Oct 13, 2023

About the attention calculation in FASA #14

About the attention calculation in FASA #14

Comments

csguoh commented Oct 1, 2023

ccyppl commented Oct 10, 2023

HanzhouLiu commented Oct 13, 2023

haodongzhang0118 commented Oct 13, 2023

HanzhouLiu commented Oct 13, 2023