MLX-LLM.cpp

MLX-LLM.cpp is a C/C++ library for LLM inference, based on mlx-llm. It leverages MLX to run on Apple Silicon.

Supported Models

Family	Models
LLaMA 2	llama_2_7b_chat_hf
LLaMA 3	llama_3_8b
TinyLLaMA	tiny_llama_1.1B_chat_v1.0

Installation

Prerequisites

First, install MLX on your system:

git clone https://github.com/ml-explore/mlx.git mlx && cd mlx
mkdir -p build && cd build
cmake .. && make -j
make install

Building MLX-LLM.cpp

Clone the repository and its submodules:

git clone https://github.com/grorge123/mlx-llm.cpp.git
cd mlx-llm.cpp
git submodule update --init --recursive

Build the example:

mkdir build && cd build
cmake ..
cmake --build .

Usage

Refer to example/main.cpp for a simple demonstration using TinyLLaMA 1.1B.

Downloading Model Weights and Tokenizer

mkdir tiny && cd tiny
wget https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/resolve/main/model.safetensors
cd ..
wget https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/resolve/main/tokenizer.json

Running the Example

From the build directory:

./main

This will generate results using the TinyLLaMA 1.1B model.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
example		example
mlx		mlx
model		model
prompt		prompt
tokenizer @ 5de6f65		tokenizer @ 5de6f65
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLX-LLM.cpp

Supported Models

Installation

Prerequisites

Building MLX-LLM.cpp

Usage

Downloading Model Weights and Tokenizer

Running the Example

About

Releases

Packages

Languages

License

grorge123/mlx-llm.cpp

Folders and files

Latest commit

History

Repository files navigation

MLX-LLM.cpp

Supported Models

Installation

Prerequisites

Building MLX-LLM.cpp

Usage

Downloading Model Weights and Tokenizer

Running the Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages