We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Hi, i need to evaluate a model with SWA so i think i need to change the attention from spda => flash_attention or others since i get
Sliding Window Attention is enabled but not implemented for `sdpa`; unexpected results may be encountered.