Docs - Website - Demo video - Self-hosted deploy
Skald gives you a production-ready RAG in minutes through a plug-and-play API, and then let's you configure your RAG engine exactly to your needs.
In the ingestion phase, Skald takes care of document parsing, chunking strategy, summaries, tagging, embedding generation, and vector storage. In the retrieval phase, it handles query rewriting, vector search, LLM chat, chat history, and source references.
Our solid defaults will work for most use cases, but you can tune every part of your RAG to better suit your needs. That means configurable vector search params, reranking, models, query rewriting, chunking (soon), and more.
Ship, configure, and evaluate performance directly inside Skald.
Node SDK example
import { Skald } from '@skald-labs/skald-node'
const skald = new Skald('your-api-key-here')
await skald.createMemo({
title: 'Meeting Notes',
content: 'Full content of the memo...',
})
const chatRes = await skald.chat({
query: 'What were the main points discussed in the Q1 meeting?',
rag_config: { reranking: { enabled: true } },
})
console.log(chatRes.response)Python - Ruby - Go - PHP - C# - MCP - CLI
git clone https://github.com/skaldlabs/skald
cd skald
echo "OPENAI_API_KEY=<your_key>" > .env
docker-compose upFor a production self-hosted deploy, check out our self-hosting docs.
Running Skald without any third-party services
You can deploy Skald without any third-party dependencies (including OpenAI), but that will require hosting your own LLM inference server and using a local embeddings service (we've provided one for you in the local docker compose profile). This is advanced usage and you should check out our docs for more details. You can also ask us questions on Slack.
- Chat: Chat with your knowledge in Skald with just one API call
- Search: Use semantic search to find relevant context based on user queries
- Turnkey configuration: Get started in minutes with great defaults and then configure your RAG engine exactly how you want it.
- Evaluate: Evaluate the performance of your custom RAG engine with our built-in evaluation tools.
- Powerful filtering: Speed up and improve responses by filtering the accessible knowledge in every query.
- Amazing DX, no bullsh*t: Implement in minutes with SDKs for every major language. Don't see yours? Open an issue and we'll build it!
- Truly open source: Our open source version is fully-featured, easy to deploy, and can even run with no third-party dependencies.
- Cloud: free tier with no credit card required
- Self-hosted: get a fully-featured production deploy with SSL live in less than an hour
MIT 🤸
We'd be glad to have your contributions! See CONTRIBUTING.md for instructions on how to run Skald locally and how to contribute.
Feel free to join our Slack Community
List of people who have contributed to any of our repos.










