-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Aggressive elementwise fusion #39
Comments
As of |
As of |
What happens if we do Instead we need to insert indexing and valid expressions in between each and every component of the equation, not just stack them at the beginning. |
As of Only remaining issue is mistral still outputs jibberish and runs slow for some reason when fusion is on. |
Ok mistral is finally fixed. Current fusion is slower than before (~15 tps vs 17 tps) but it is correct, unlike before. The reason for the slowdown is almost definitely due to huge index and valid expressions. This will be solved in #47 |
Currently the elementwise fusion is very conservative in what it fuses. It can be a lot more aggressive by:
It should be possible to fuse this test down to a single kernel:
The text was updated successfully, but these errors were encountered: