Skip to content

SA Output, Misalign w/ Fig. 4 Overview? #51

@GloryyrolG

Description

@GloryyrolG

hi @omerbt @MichalGeyer @duongna21 @ all folks et. al.,

thx for ur contribution! may i ask if there is a mismatch between the extracted token is the output of self-attn in the paper & implementation in the code (seems to be latent, the output of the whole attn blk, ff, or the input of self-attn)? thx & best

self.pivot_hidden_states[0][batch_idxs].reshape(-1, dim))

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions