Open
Description
We should follow https://pytorch.org/docs/main/generated/torch.nn.functional.scaled_dot_product_attention.html to implement enable_gqa
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment