Mask in TokenAttentionPooling

### Report

Hi, very nice work!

I'm trying to understand the code of TokenAttentionPooling, it seems to me the class token will always attend to every payload token. And since there is only a single attention call, the class token is return as the result, the input provide mask should not have any effect? I used some different masks and the result of this class is identical. Or maybe I missed something here?

https://github.com/theislab/CellFlow/blob/4970ad45d425de715bb60474eaa8c9b548da8f89/src/cellflow/networks/_utils.py#L420

Besides, I wonder in practice, how do you choose between TokenAttentionPooling and SeedAttentionPooling?

Thanks!





### Version information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mask in TokenAttentionPooling #226

Report

Version information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Mask in TokenAttentionPooling #226

Description

Report

Version information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions