We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
https://github.com/turboderp/exllama 埃克斯拉玛 重写了 Llama 的 HF 变压器实现,目标如下:
设计用于量化砝码 快速且节省内存的推理(不仅仅是注意力) 跨多个设备映射 内置 (多) LoRA 支持 时髦采样函数的配套库
The text was updated successfully, but these errors were encountered:
No branches or pull requests
https://github.com/turboderp/exllama
埃克斯拉玛
重写了 Llama 的 HF 变压器实现,目标如下:
设计用于量化砝码
快速且节省内存的推理(不仅仅是注意力)
跨多个设备映射
内置 (多) LoRA 支持
时髦采样函数的配套库
The text was updated successfully, but these errors were encountered: