Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Kernel][Backend][Model] Blocksparse flash attention kernel and Phi-3-Small model #4799
[Kernel][Backend][Model] Blocksparse flash attention kernel and Phi-3-Small model #4799
Changes from 82 commits
0e4c28d
b2e7c0a
d5308c5
d73cdb3
176275e
1116e01
24ab443
b6c2ebe
4e28773
bccec2f
ab0df74
f11d590
1670a3d
8531eaa
b20312c
26c6222
0a52b2b
f85da14
0ee826b
3891f22
1440eba
6571c58
7143bac
a1f37a9
8ff8be7
439c7c7
bfba8d5
9473082
f4c53d3
809f3f5
7868f0a
7d92de3
ca27e7a
7c0cfd7
5389e35
e5747f8
c68ecb5
0718c86
85b0ed5
66b04d6
c39be85
b2df3f7
2ff8778
bb0ff75
e5c7212
dbd6b47
8eac29c
71663e8
0be4ce2
aa65d2e
fd5486a
63b9bb8
8d3ec74
e1dd365
561d5a8
bfd3c80
7646e00
201c2c1
c1f7c26
3db3010
55f0d4b
36009b4
0c6b10c
a989bc4
a3efa6a
a26c269
d3f2943
525c48d
9b7d192
2eefeda
90a0a87
300797c
d5cd48c
04b3cdc
83b23e5
87bd2ac
d6ea404
8e86707
69d412e
33a1930
e9dc082
dfc07c7
1197728
2955cec
eb16d9a
1600156
e7f9918
2afd8b1
def0c4c
359cc7f
6d0441b
52bf2b5
97f3662
754e306
c834882
644fc14
435dd38
8a22c26
8554331
547692e
daf94f3
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Large diffs are not rendered by default.