Open
Description
openedon Jun 22, 2024
May I request for an option to enable flash attention option in the UI.
Current model is spitting nonsense and requires flash attention to run.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment