-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ggml : add Flash Attention #5021
Merged
Merged
Commits on Jan 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a1c004e - Browse repository at this point
Copy the full SHA a1c004eView commit details
Commits on Jan 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for fa7ebcc - Browse repository at this point
Copy the full SHA fa7ebccView commit details
Commits on Jan 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c3cdfff - Browse repository at this point
Copy the full SHA c3cdfffView commit details -
Configuration menu - View commit details
-
Copy full SHA for a9681fe - Browse repository at this point
Copy the full SHA a9681feView commit details
Commits on Jan 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1173f49 - Browse repository at this point
Copy the full SHA 1173f49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 528da75 - Browse repository at this point
Copy the full SHA 528da75View commit details -
Configuration menu - View commit details
-
Copy full SHA for 52ae085 - Browse repository at this point
Copy the full SHA 52ae085View commit details -
Configuration menu - View commit details
-
Copy full SHA for b973258 - Browse repository at this point
Copy the full SHA b973258View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8cde449 - Browse repository at this point
Copy the full SHA 8cde449View commit details -
Configuration menu - View commit details
-
Copy full SHA for f31955f - Browse repository at this point
Copy the full SHA f31955fView commit details -
Configuration menu - View commit details
-
Copy full SHA for a4b6341 - Browse repository at this point
Copy the full SHA a4b6341View commit details -
Configuration menu - View commit details
-
Copy full SHA for 77d08f3 - Browse repository at this point
Copy the full SHA 77d08f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 17720fa - Browse repository at this point
Copy the full SHA 17720faView commit details
Commits on Jan 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1446a12 - Browse repository at this point
Copy the full SHA 1446a12View commit details -
Configuration menu - View commit details
-
Copy full SHA for d917746 - Browse repository at this point
Copy the full SHA d917746View commit details -
Configuration menu - View commit details
-
Copy full SHA for 432ad04 - Browse repository at this point
Copy the full SHA 432ad04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 40ea8cd - Browse repository at this point
Copy the full SHA 40ea8cdView commit details -
Configuration menu - View commit details
-
Copy full SHA for f9ca5dc - Browse repository at this point
Copy the full SHA f9ca5dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6fea843 - Browse repository at this point
Copy the full SHA 6fea843View commit details
Commits on Jan 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b3dd7d9 - Browse repository at this point
Copy the full SHA b3dd7d9View commit details -
metal : move output into local memory + optimize
- the result from each simdgroup now stays in the registers - significantly reduced SRAM usage - more efficient skipping of -INF blocks - avoid simdgroup barrier in hot loop - add comments
Configuration menu - View commit details
-
Copy full SHA for 77f6976 - Browse repository at this point
Copy the full SHA 77f6976View commit details -
Configuration menu - View commit details
-
Copy full SHA for ecc466a - Browse repository at this point
Copy the full SHA ecc466aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a428a1 - Browse repository at this point
Copy the full SHA 3a428a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8612864 - Browse repository at this point
Copy the full SHA 8612864View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ad44ba - Browse repository at this point
Copy the full SHA 0ad44baView commit details -
Configuration menu - View commit details
-
Copy full SHA for 134c81c - Browse repository at this point
Copy the full SHA 134c81cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1db22d7 - Browse repository at this point
Copy the full SHA 1db22d7View commit details
Commits on Jan 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4794821 - Browse repository at this point
Copy the full SHA 4794821View commit details -
Configuration menu - View commit details
-
Copy full SHA for abeaf0d - Browse repository at this point
Copy the full SHA abeaf0dView commit details -
Configuration menu - View commit details
-
Copy full SHA for c6c1132 - Browse repository at this point
Copy the full SHA c6c1132View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fcb9c1 - Browse repository at this point
Copy the full SHA 5fcb9c1View commit details
Commits on Jan 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d073e4f - Browse repository at this point
Copy the full SHA d073e4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78df552 - Browse repository at this point
Copy the full SHA 78df552View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d03bcb - Browse repository at this point
Copy the full SHA 3d03bcbView commit details
Commits on Jan 31, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2ddc9bb - Browse repository at this point
Copy the full SHA 2ddc9bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ad92dc - Browse repository at this point
Copy the full SHA 8ad92dcView commit details
Commits on Feb 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 910b15b - Browse repository at this point
Copy the full SHA 910b15bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e46013 - Browse repository at this point
Copy the full SHA 2e46013View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a19a9f - Browse repository at this point
Copy the full SHA 5a19a9fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 41d136b - Browse repository at this point
Copy the full SHA 41d136bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 56e45a2 - Browse repository at this point
Copy the full SHA 56e45a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for cda5a60 - Browse repository at this point
Copy the full SHA cda5a60View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6769b9 - Browse repository at this point
Copy the full SHA c6769b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for db1f3c4 - Browse repository at this point
Copy the full SHA db1f3c4View commit details
Commits on Feb 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 12eaa22 - Browse repository at this point
Copy the full SHA 12eaa22View commit details -
Configuration menu - View commit details
-
Copy full SHA for b68a112 - Browse repository at this point
Copy the full SHA b68a112View commit details
Commits on Feb 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b150abe - Browse repository at this point
Copy the full SHA b150abeView commit details -
cuda : use int instead of int64_t
Noticeably improves performance (thanks to Johannes)
Configuration menu - View commit details
-
Copy full SHA for 7c34655 - Browse repository at this point
Copy the full SHA 7c34655View commit details -
cuda : make loops use the same loop values
Thanks Johannes again for the tip
Configuration menu - View commit details
-
Copy full SHA for 1f8a592 - Browse repository at this point
Copy the full SHA 1f8a592View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92472ea - Browse repository at this point
Copy the full SHA 92472eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for c51f27c - Browse repository at this point
Copy the full SHA c51f27cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b958151 - Browse repository at this point
Copy the full SHA b958151View commit details -
Configuration menu - View commit details
-
Copy full SHA for a7b4715 - Browse repository at this point
Copy the full SHA a7b4715View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b1c4e7 - Browse repository at this point
Copy the full SHA 3b1c4e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b263dd - Browse repository at this point
Copy the full SHA 5b263ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for e04ff39 - Browse repository at this point
Copy the full SHA e04ff39View commit details -
Configuration menu - View commit details
-
Copy full SHA for cfd9732 - Browse repository at this point
Copy the full SHA cfd9732View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef68fac - Browse repository at this point
Copy the full SHA ef68facView commit details
Commits on Feb 4, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1846e92 - Browse repository at this point
Copy the full SHA 1846e92View commit details
Commits on Feb 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6875997 - Browse repository at this point
Copy the full SHA 6875997View commit details
Commits on Feb 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 31109ca - Browse repository at this point
Copy the full SHA 31109caView commit details -
Configuration menu - View commit details
-
Copy full SHA for f249c99 - Browse repository at this point
Copy the full SHA f249c99View commit details
Commits on Mar 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 02a645e - Browse repository at this point
Copy the full SHA 02a645eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6aefd11 - Browse repository at this point
Copy the full SHA 6aefd11View commit details
Commits on Mar 4, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e307882 - Browse repository at this point
Copy the full SHA e307882View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58c7f61 - Browse repository at this point
Copy the full SHA 58c7f61View commit details
Commits on Mar 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9495d39 - Browse repository at this point
Copy the full SHA 9495d39View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a468e6 - Browse repository at this point
Copy the full SHA 3a468e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0953212 - Browse repository at this point
Copy the full SHA 0953212View commit details
Commits on Mar 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e425810 - Browse repository at this point
Copy the full SHA e425810View commit details
Commits on Mar 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 013721d - Browse repository at this point
Copy the full SHA 013721dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6be02b5 - Browse repository at this point
Copy the full SHA 6be02b5View commit details
Commits on Mar 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 57c03b7 - Browse repository at this point
Copy the full SHA 57c03b7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e318e7 - Browse repository at this point
Copy the full SHA 3e318e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 08e69c5 - Browse repository at this point
Copy the full SHA 08e69c5View commit details
Commits on Apr 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 75aa7b4 - Browse repository at this point
Copy the full SHA 75aa7b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for d59ac67 - Browse repository at this point
Copy the full SHA d59ac67View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81da919 - Browse repository at this point
Copy the full SHA 81da919View commit details -
Configuration menu - View commit details
-
Copy full SHA for 269374e - Browse repository at this point
Copy the full SHA 269374eView commit details -
Configuration menu - View commit details
-
Copy full SHA for cca6d02 - Browse repository at this point
Copy the full SHA cca6d02View commit details -
Configuration menu - View commit details
-
Copy full SHA for 68d793b - Browse repository at this point
Copy the full SHA 68d793bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f777ac - Browse repository at this point
Copy the full SHA 3f777acView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1ecd3b - Browse repository at this point
Copy the full SHA e1ecd3bView commit details -
Configuration menu - View commit details
-
Copy full SHA for bb0d51a - Browse repository at this point
Copy the full SHA bb0d51aView commit details -
Configuration menu - View commit details
-
Copy full SHA for c63dfdf - Browse repository at this point
Copy the full SHA c63dfdfView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee19a4a - Browse repository at this point
Copy the full SHA ee19a4aView commit details
Commits on Apr 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 89961de - Browse repository at this point
Copy the full SHA 89961deView commit details
Commits on Apr 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2c41180 - Browse repository at this point
Copy the full SHA 2c41180View commit details -
Configuration menu - View commit details
-
Copy full SHA for 599ce84 - Browse repository at this point
Copy the full SHA 599ce84View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4053857 - Browse repository at this point
Copy the full SHA 4053857View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5668c79 - Browse repository at this point
Copy the full SHA 5668c79View commit details
Commits on Apr 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 34f93bb - Browse repository at this point
Copy the full SHA 34f93bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a3b842 - Browse repository at this point
Copy the full SHA 6a3b842View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef9e159 - Browse repository at this point
Copy the full SHA ef9e159View commit details -
Configuration menu - View commit details
-
Copy full SHA for a5b0e2d - Browse repository at this point
Copy the full SHA a5b0e2dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bc67dd - Browse repository at this point
Copy the full SHA 0bc67ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f538b9 - Browse repository at this point
Copy the full SHA 2f538b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87968de - Browse repository at this point
Copy the full SHA 87968deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 260cdb2 - Browse repository at this point
Copy the full SHA 260cdb2View commit details -
metal : add BS=1 kernel for flash attention (#6508)
* metal : add BS=1 kernel for flash attention (wip) * metal : support more than 1 warps * metal : opts * metal : opt * metal : switch to parallel reduce * metal : reduce registers * metal : simplify * metal : initial FA vec kernel
Configuration menu - View commit details
-
Copy full SHA for 105332c - Browse repository at this point
Copy the full SHA 105332cView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa9e8c6 - Browse repository at this point
Copy the full SHA fa9e8c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for c16a7c2 - Browse repository at this point
Copy the full SHA c16a7c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ca8698 - Browse repository at this point
Copy the full SHA 9ca8698View commit details
Commits on Apr 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 74d57f9 - Browse repository at this point
Copy the full SHA 74d57f9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1db66c1 - Browse repository at this point
Copy the full SHA 1db66c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e32b281 - Browse repository at this point
Copy the full SHA e32b281View commit details -
Configuration menu - View commit details
-
Copy full SHA for 703c6e6 - Browse repository at this point
Copy the full SHA 703c6e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97eaece - Browse repository at this point
Copy the full SHA 97eaeceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1a88565 - Browse repository at this point
Copy the full SHA 1a88565View commit details -
Configuration menu - View commit details
-
Copy full SHA for bc34616 - Browse repository at this point
Copy the full SHA bc34616View commit details -
Configuration menu - View commit details
-
Copy full SHA for 29f6ad8 - Browse repository at this point
Copy the full SHA 29f6ad8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5294542 - Browse repository at this point
Copy the full SHA 5294542View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3badef1 - Browse repository at this point
Copy the full SHA 3badef1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 871fcb6 - Browse repository at this point
Copy the full SHA 871fcb6View commit details
Commits on Apr 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a39217d - Browse repository at this point
Copy the full SHA a39217dView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb76d74 - Browse repository at this point
Copy the full SHA cb76d74View commit details -
Configuration menu - View commit details
-
Copy full SHA for c11d05f - Browse repository at this point
Copy the full SHA c11d05fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f725ca9 - Browse repository at this point
Copy the full SHA f725ca9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5408d55 - Browse repository at this point
Copy the full SHA 5408d55View commit details -
Configuration menu - View commit details
-
Copy full SHA for c70bfd7 - Browse repository at this point
Copy the full SHA c70bfd7View commit details
Commits on Apr 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c129369 - Browse repository at this point
Copy the full SHA c129369View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3864eea - Browse repository at this point
Copy the full SHA 3864eeaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78d363b - Browse repository at this point
Copy the full SHA 78d363bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 19e8982 - Browse repository at this point
Copy the full SHA 19e8982View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56657e5 - Browse repository at this point
Copy the full SHA 56657e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for d228bf8 - Browse repository at this point
Copy the full SHA d228bf8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 751591d - Browse repository at this point
Copy the full SHA 751591dView commit details
Commits on Apr 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8937ec5 - Browse repository at this point
Copy the full SHA 8937ec5View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce281b9 - Browse repository at this point
Copy the full SHA ce281b9View commit details
Commits on Apr 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1f77f49 - Browse repository at this point
Copy the full SHA 1f77f49View commit details -
Configuration menu - View commit details
-
Copy full SHA for ff2c64a - Browse repository at this point
Copy the full SHA ff2c64aView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb3547a - Browse repository at this point
Copy the full SHA cb3547aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1fd5bc3 - Browse repository at this point
Copy the full SHA 1fd5bc3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 09d0381 - Browse repository at this point
Copy the full SHA 09d0381View commit details -
Configuration menu - View commit details
-
Copy full SHA for ac1c6d9 - Browse repository at this point
Copy the full SHA ac1c6d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for c225609 - Browse repository at this point
Copy the full SHA c225609View commit details -
Configuration menu - View commit details
-
Copy full SHA for bab346b - Browse repository at this point
Copy the full SHA bab346bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fc5c5e - Browse repository at this point
Copy the full SHA 0fc5c5eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e590ac - Browse repository at this point
Copy the full SHA 1e590acView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f4c024 - Browse repository at this point
Copy the full SHA 4f4c024View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e38760 - Browse repository at this point
Copy the full SHA 9e38760View commit details
Commits on Apr 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a1616e9 - Browse repository at this point
Copy the full SHA a1616e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for ca0275c - Browse repository at this point
Copy the full SHA ca0275cView commit details
Commits on Apr 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e180fcd - Browse repository at this point
Copy the full SHA e180fcdView commit details -
Configuration menu - View commit details
-
Copy full SHA for c240ae2 - Browse repository at this point
Copy the full SHA c240ae2View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.