Sparse attention : Generalize arch checks for A100 and above #73

ExtReMLapin · 2025-07-11T07:21:17Z

No description provided.

ExtReMLapin · 2025-07-11T08:22:54Z

Tested on a RTX 5090 with fp8 quantization AND tensor split at 2, working 👍🏻

LucasWilkinson

Makes sense to me since we build for 8.0+PTX;

can you please just sign off on the commits so the DCO passes

Signed-off-by: CNE FICHEPOIL Pierre <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr>

ExtReMLapin · 2025-07-17T14:44:58Z

Sorry about the DCO thing, most of my PR are rushed
should be good now

ExtReMLapin · 2025-07-28T07:53:54Z

@LucasWilkinson anything else missing ?

LucasWilkinson · 2025-07-28T13:34:22Z

Looks good, thanks for the contribution!

Signed-off-by: Jay Shah <jayhshah@gmail.com>

ExtReMLapin marked this pull request as ready for review July 11, 2025 07:45

ExtReMLapin mentioned this pull request Jul 11, 2025

Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support vllm-project/vllm#11844

Merged

LucasWilkinson approved these changes Jul 15, 2025

View reviewed changes

Update flash_api_sparse.cpp to support SM_120

7588b42

Signed-off-by: CNE FICHEPOIL Pierre <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr>

ExtReMLapin force-pushed the patch-1 branch from 7447d01 to 7588b42 Compare July 17, 2025 14:44

LucasWilkinson merged commit 2c87db5 into vllm-project:main Jul 28, 2025
1 check passed

jayhshah pushed a commit that referenced this pull request Aug 8, 2025

Update flash_api_sparse.cpp to support SM_120 (#73)

e77f7bc

Signed-off-by: Jay Shah <jayhshah@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sparse attention : Generalize arch checks for A100 and above #73

Sparse attention : Generalize arch checks for A100 and above #73

Uh oh!

ExtReMLapin commented Jul 11, 2025

Uh oh!

ExtReMLapin commented Jul 11, 2025

Uh oh!

LucasWilkinson left a comment •

edited

Loading

Uh oh!

ExtReMLapin commented Jul 17, 2025 •

edited

Loading

Uh oh!

ExtReMLapin commented Jul 28, 2025

Uh oh!

Uh oh!

LucasWilkinson commented Jul 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sparse attention : Generalize arch checks for A100 and above #73

Sparse attention : Generalize arch checks for A100 and above #73

Uh oh!

Conversation

ExtReMLapin commented Jul 11, 2025

Uh oh!

ExtReMLapin commented Jul 11, 2025

Uh oh!

LucasWilkinson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ExtReMLapin commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ExtReMLapin commented Jul 28, 2025

Uh oh!

Uh oh!

LucasWilkinson commented Jul 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LucasWilkinson left a comment •

edited

Loading

ExtReMLapin commented Jul 17, 2025 •

edited

Loading