-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LLM] Support fuse attention q, k, v weights #8202
Merged
Merged
Commits on Mar 18, 2024
-
1. add use-interface & fuse action
1.1. modify 1., code order 2. switch to name_mapping 3. solve tp branch 3.2 follow hui, handel qkv separately 3.3 handle pdparams 3.4 from torch 3.5 abandon low_cpu_mem_usage 3.6 solve shard branch
Configuration menu - View commit details
-
Copy full SHA for d859826 - Browse repository at this point
Copy the full SHA d859826View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7b6973 - Browse repository at this point
Copy the full SHA f7b6973View commit details
Commits on Mar 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 05507a7 - Browse repository at this point
Copy the full SHA 05507a7View commit details
Commits on Mar 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b8b828b - Browse repository at this point
Copy the full SHA b8b828bView commit details
Commits on Mar 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 28ed30f - Browse repository at this point
Copy the full SHA 28ed30fView commit details
Commits on Apr 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1a1591a - Browse repository at this point
Copy the full SHA 1a1591aView commit details -
Configuration menu - View commit details
-
Copy full SHA for ef1fb18 - Browse repository at this point
Copy the full SHA ef1fb18View commit details
Commits on Apr 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f56e602 - Browse repository at this point
Copy the full SHA f56e602View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19faa26 - Browse repository at this point
Copy the full SHA 19faa26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a92dde - Browse repository at this point
Copy the full SHA 4a92ddeView commit details
Commits on Apr 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c0a71d2 - Browse repository at this point
Copy the full SHA c0a71d2View commit details
Commits on Apr 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d1e7d17 - Browse repository at this point
Copy the full SHA d1e7d17View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58a2c23 - Browse repository at this point
Copy the full SHA 58a2c23View commit details
Commits on Apr 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7e1ab08 - Browse repository at this point
Copy the full SHA 7e1ab08View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0458f4e - Browse repository at this point
Copy the full SHA 0458f4eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 920155b - Browse repository at this point
Copy the full SHA 920155bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a90fcfc - Browse repository at this point
Copy the full SHA a90fcfcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 774bb6f - Browse repository at this point
Copy the full SHA 774bb6fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 057198c - Browse repository at this point
Copy the full SHA 057198cView commit details
Commits on Apr 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for db78c6a - Browse repository at this point
Copy the full SHA db78c6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a1aa078 - Browse repository at this point
Copy the full SHA a1aa078View commit details -
Configuration menu - View commit details
-
Copy full SHA for f0d19f6 - Browse repository at this point
Copy the full SHA f0d19f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 110983d - Browse repository at this point
Copy the full SHA 110983dView commit details
Commits on Apr 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 113b883 - Browse repository at this point
Copy the full SHA 113b883View commit details
Commits on Apr 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1fb0a7d - Browse repository at this point
Copy the full SHA 1fb0a7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a118f1b - Browse repository at this point
Copy the full SHA a118f1bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 11d0793 - Browse repository at this point
Copy the full SHA 11d0793View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c98a45 - Browse repository at this point
Copy the full SHA 0c98a45View commit details
Commits on Apr 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for f6f3b0e - Browse repository at this point
Copy the full SHA f6f3b0eView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.