GitHub · Where software is built

Preview

examples : add configuration presets
#10932 · ggerganov opened on Dec 21, 2024
3
changelog : libllama API
#9289 · ggerganov opened on Sep 3, 2024
5
changelog : llama-server REST API
#9291 · ggerganov opened on Sep 3, 2024
12

Labels Milestones New issue

Misc. bug: The KV cache is sometimes truncated incorrectly when making v1/chat/completions API calls

#11970

· vnicolici opened

on Feb 20, 2025

Misc. bug: Problems with official jinja templates (Gemma 2, Llama 3.2, Qwen 2.5)

#11866

· MoonRide303 opened

on Feb 14, 2025

Eval bug: trivial grammar crashes (DeepSeek R1 Distill Llama 8B)

#11591

· ochafik opened

on Feb 2, 2025

Eval bug: Error running multiple contexts from multiple threads at the same time with Vulkan

#11371

· charlesrwest opened

on Jan 23, 2025

Eval bug: segfault on Alpine linux docker image

#11308

· pepijndevos opened

on Jan 20, 2025

Eval bug: Crash with filesystem error when run while in a directory containing files with certain names

#11198

· ScarletEmerald opened

on Jan 11, 2025

Compile bug: Emulated Linux ARM64 CPU build fails

#10933

· SamuelTallet opened

on Dec 21, 2024

Misc. bug: Version string incomplete on Windows CUDA build

#10727

· CentricStorm opened

on Dec 9, 2024

Misc. bug: softmax may get error answer when src0->ne[3]!=1 on cuda

#10683

· A3shTnT opened

on Dec 6, 2024

Misc. bug: interface for model quantization is not fully C-compatible

#10614

· JohannesGaessler opened

on Dec 1, 2024

Misc. bug: inconsistent locale for printing GGUF kv data across examples

#10613

· JohannesGaessler opened

on Dec 1, 2024

Misc. bug: -sm row does not work with --device

#10533

· mostlygeek opened

on Nov 26, 2024