yla

Run your own AI chatbot locally with Ollama, ensuring data privacy and avoiding cloud breaches. This lightweight setup requires no frameworks like React - just Ollama, a simple HTTP server, and a browser.

Evaluate and experiment LLM freely, own your AI.

Features

Local AI Control: Select from multiple configured models
Privacy First: All processing happens on your machine
No Internet Dependencies: Works completely offline
Custom Personalities: Create specialized assistants with Modelfiles
Explore Different Parameters: You can resend a message changing temperature, top_p, and top_k
Expand with New Models on Ollama: When a new open-source model is deployed on Ollama server, you can download it and use it right away.
Visualize Reasoning: Currently the reasoning part is showed, but it must be wrapped in the <think> tag (DeepSeek reasoning token), but it can be extend/modified.

Installation

1. Install Ollama

Linux:

curl -fsSL https://ollama.com/install.sh | sh

Windows:

Download from Ollama Windows Preview

2. Setup Base Models

# Start Ollama server locally
ollama serve

Ollama could be running in background, so the previous command could fail. Please check if it running at the default url localhost:11434

# Choose based on your hardware
ollama pull deepseek-r1:7b   # 7B parameter version
ollama pull deepseek-r1:32b  # Medium-sized model

Check Ollama list of avaiable models here.

Models can also be downloaded from Yla interface, but you have to specify the name in the config file.

3. Launch Application

Linux:

chmod +x utils/run-yla.sh
./utils/run-yla.sh

Default script uses Chromium. Please change it if you want to use a different browser!

Windows:

ollama serve
python -m http.server 8000
# Open http://localhost:8000/src/yla.html on your browser

Custom Model Configuration

Create and use specialized models using Ollama's Modelfiles:

Create a Modelfile in the custom_models directory:

# Example technical-assistant.txt
FROM deepseek-r1:7b
SYSTEM """
You are an expert technical assistant. 
Respond in markdown format with detailed explanations.
"""
PARAMETER num_ctx 32768

Build your custom model:

ollama create my-expert -f ./custom_models/technical-assistant.txt

Add it to config.js:

models: [
    {
        name: "my-expert:latest",
        description: "smart coding assistant",
        num_ctx: 32768,
        temperature: 0.7,
        top_k: 40,
        top_p: 0.9,
        systemMessage: "You are an expert technical assistant. 
Respond in markdown format with detailed explanations.",
        size: "4.1GB"
    }
]

Configuration Reference

Field	Description	Example
`name`	Ollama model name (exact match required). Add version in case of custom models	"my-expert:latest"
`description`	Brief description (optional)	"smart coding assistant"
`size`	Size of the model. Used only to add the information on the model panel	"4.1GB"
`num_ctx`	Context window size (tokens)	32768
`systemMessage`	Hidden behavior instructions. It does not affect the model, but it is used to show, in case of custom model, the SYSTEM message defined in the creation phase	"You are an expert technical assistant. Respond in markdown format with detailed explanations."
`temperature`	Response creativity	0.3-1.8

For custom models remember to add the version (:latest by default) in the config.

Configuration Example

// config.js
const config = {
    models: [
        {
            name: "Yla:latest",          // Must match Ollama model name
            num_ctx: 65536,         // Context window size
            temperature: 0.7,       // 0-2 (0=precise, 2=creative)
            top_k: 40,              // 1-100
            top_p: 0.9,              // 0-1
            systemMessage: "Friendly general assistant",
            size: "4.1GB"
        }
    ],
    
    api: {
        endpoint: "http://localhost:11434/v1/chat/completions",
        available_models: "http://localhost:11434/v1/models",
    }
};

Model Management

Validation: App checks installed models on launch
Selection: Choose model from initial carousel
Download: If it is not present on Ollama server, it will try to pull the model from Ollama. In case the download is allowed only for valid names
Persistence: Selected model remembered until page refresh
Visual Feedback:
- ✅ Available models - full color, clickable
- ⚠️ Missing models - grayed out with warning. Possibility of download

Note: Model names in config.js must exactly match Ollama model names. Use ollama list to verify installed models.

Troubleshooting

Model Not Found:

Verify model name matches Ollama's list
Check Ollama is running: ollama serve
Ensure model files are in custom_models directory

Performance Issues:

Reduce num_ctx for smaller context windows
Use smaller model variants (e.g., 7B instead of 32B)
Close other memory-intensive applications

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
docker		docker
src		src
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

yla

Features

Installation

1. Install Ollama

Linux:

Windows:

2. Setup Base Models

3. Launch Application

Linux:

Windows:

Custom Model Configuration

Configuration Reference

Configuration Example

Model Management

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

danielekp/yla

Folders and files

Latest commit

History

Repository files navigation

yla

Features

Installation

1. Install Ollama

Linux:

Windows:

2. Setup Base Models

3. Launch Application

Linux:

Windows:

Custom Model Configuration

Configuration Reference

Configuration Example

Model Management

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages