Closed
Description
Report
Hi, very nice work!
I'm trying to understand the code of TokenAttentionPooling, it seems to me the class token will always attend to every payload token. And since there is only a single attention call, the class token is return as the result, the input provide mask should not have any effect? I used some different masks and the result of this class is identical. Or maybe I missed something here?
CellFlow/src/cellflow/networks/_utils.py
Line 420 in 4970ad4
Besides, I wonder in practice, how do you choose between TokenAttentionPooling and SeedAttentionPooling?
Thanks!
Version information
No response