Skip to content

[webgpu] Apply Flash Attention if sliding window exceeds KV cache len…

de8c7fe
Select commit
Loading
Failed to load commit list.
Merged

Cherry-picks for ORT 1.23.0 #25620

[webgpu] Apply Flash Attention if sliding window exceeds KV cache len…
de8c7fe
Select commit
Loading
Failed to load commit list.
Azure Pipelines / ONNX Runtime React Native CI Pipeline (Build_Android_Packages Android_Java_API_AAR_Packaging_For_React_Native) succeeded Aug 1, 2025 in 11m 31s

Build_Android_Packages Android_Java_API_AAR_Packaging_For_React_Native succeeded