Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

January 2025 Update #1036

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

martindevans
Copy link
Member

Updated llama.cpp binaries to 0827b2c1da299805288abbd556d869318f2b121e.

This introduces new binaries. Previously there were 2 for each platform, e.g. ggml.dll, llama.dll for CUDA. Now there are more: ggml.dll, ggml-base.dll, ggml-cpu.dll, ggml-cuda.dll, llama.dll.

Currently these are handled in the same way as the old system - each platform has it's own set of completely independent binaries. In the future this should be modified to dynamically load backends. See more details here and here.

Testing:

  • Windows CPU
  • Windows CUDA
  • Windows Vulkan
  • Linux CPU
  • Linux CUDA
  • Linux Vulkan
  • MacOS CPU
  • MacOS Metal

@LSXAxeller
Copy link

LSXAxeller commented Jan 10, 2025

Thanks for your hard work. Any chance this can be updated with the latest binaries? Mainstream got QRWKV Hybrid, Phi-3.5-MoE, Deepseek V3, C4AI Command-R 7B models, INT-8 and BF16 implementations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants