-
Notifications
You must be signed in to change notification settings - Fork 9.7k
Issues: ggerganov/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ggml : add WebGPU backend
help wanted
Extra attention is needed
research π¬
#7773
opened Jun 5, 2024 by
ggerganov
ggml : add DirectML backend
help wanted
Extra attention is needed
research π¬
#7772
opened Jun 5, 2024 by
ggerganov
metal : compile-time kernel args and params
performance
Speed related topics
research π¬
#4085
opened Nov 15, 2023 by
ggerganov
Layer skipping/self-speculation demo
demo
Demonstrate some concept or idea, not intended to be merged
research π¬
#3565
opened Oct 10, 2023 by
KerfuffleV2
•
Draft
llama : combined beam search + grammar sampling strategy
generation quality
Quality of model output
good first issue
Good for newcomers
research π¬
#2923
opened Aug 31, 2023 by
ggerganov
mpi : attempt inference of 65B LLaMA on a cluster of Raspberry Pis
hardware
Hardware related
help wanted
Extra attention is needed
research π¬
π¦.
llama
#2164
opened Jul 10, 2023 by
ggerganov
[IDEA] Global token enhancement/depression
help wanted
Extra attention is needed
research π¬
#1865
opened Jun 15, 2023 by
elephantpanda
Added Arbitrary mixed quantization
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research π¬
#1834
opened Jun 13, 2023 by
Milkdrop
Loadingβ¦
Q4_0 scale selection using RMSE
enhancement
New feature or request
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research π¬
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
Study how LM Evaluation Harness works and try to implement it
enhancement
New feature or request
generation quality
Quality of model output
help wanted
Extra attention is needed
high priority
Very important issue
research π¬
#231
opened Mar 17, 2023 by
ggerganov
ProTip!
Mix and match filters to narrow down what youβre looking for.