forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationstale
Description
- attention backend docs: docs
- backend selection logic: vllm.attention.selector.py API
- adding a custom attention backend
- should implement custom backend under the vllm.attention.backends module and register it in the selector logic
- #7315 (FlexAttention)
- #5083 (Triton-only backend)
- PR #21366 (Auto-Select Attention Backend)
Metadata
Metadata
Assignees
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationstale