llama-cpp-swift

Swift bindings for llama.cpp thanks to which you'll be able to run compatible LLM models directly on your device.

Features

Lightweight and easy to use
Works on macOS and Linux
Supports streaming via structured concurrency
Swift 6 ready!

TODO

Unit tests
Model downloads from URL and HuggingFace

How to install

Use swift package manager:

.package(url: "https://github.com/srgtuszy/llama-cpp-swift", branch: "main")

How to use

Here's a quick example on how to use it. For more, please refer to an example app in example/ folder.

// Initialize model
let model = try Model(modelPath: "<model path>")
let llama = try LLama(model: model)

// Results are delivered through an `AsyncStream`
let prompt = "what is the meaning of life?"
for try await token in await llama.infer(prompt: prompt, maxTokens: 1024) {
    print(token, terminator: "")
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.mdown

README.mdown

llama-cpp-swift

Features

TODO

How to install

How to use

Files

README.mdown

Latest commit

History

README.mdown

File metadata and controls

llama-cpp-swift

Features

TODO

How to install

How to use