Intrinsic Model server

A simple API server on top of your favorite locally runnable foundation models. We support

llama.cpp compatible models, including k-quant models
- ARM NEON CPUs, BLAS, (CUDA coming soon)
Whisper transcription models (COMING SOON)
Visual models for object detection and segmentation (COMING SOON)

Supported model types

Name		Name	Last commit message	Last commit date
Latest commit History 332 Commits
.github		.github
.vscode		.vscode
docs		docs
frontend		frontend
modelserver		modelserver
scripts		scripts
workerproto		workerproto
workerserver		workerserver
.gitignore		.gitignore
.hintrc		.hintrc
Dockerfile.modelserver		Dockerfile.modelserver
Dockerfile.workerserver		Dockerfile.workerserver
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml