-
Notifications
You must be signed in to change notification settings - Fork 26.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gemma2: fix FA2 generation #32553
Gemma2: fix FA2 generation #32553
Conversation
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks, I don't remember where this is from but we need a patch!
Yes, I'll add link to this PR in the internal slack thread for patches :) |
Already added 😉 |
What does this PR do?
As reported in #32309 (comment) Gemma2 FA2 is broken in the latest release due to applying 4D mask when preparing inputs. FA2 expects a 2D mask so we can skip that step if FA2 is set
Ran the FA2 generate tests, everything passing, except for the Phi3 which is already fixed in another PR for patch release