🕉️ BramhaShiv AI

Smart multi-provider coding router for OpenCode

Classifies each coding task with Gemini Flash and routes it to the best-fit model from a 19-model catalog spanning OpenAI (GPT-5.x via ChatGPT OAuth), Google (Gemini), Anthropic (Claude), HuggingFace, and OpenRouter.

🪔 The name — Brahma (Hindu god of creation) + Shiva (the architect, destroyer of obstacles) — the AI that creates and shapes code with the right tool for each job.

✨ What it does

🎯 You type a coding task.
🧮 Gemini Flash reads it and emits six trait weights (long-context, deep-reasoning, tool-use-accuracy, speed, frontend-taste, cost-efficiency).
📊 Each catalog model is scored: score = Σ weight × model.score.
🚀 The top-ranked model runs your task via OpenCode's normal tool loop.
👀 OpenCode's own model indicator shows the final choice every turn.

📦 Scope

v0.2 (current): 5-layer adaptive routing — trait-based routing + workspace-aware classification + closed-loop learning + task decomposition planner + regeneration intelligence + provider health monitoring. Server plugin for OpenCode's chat.message hook, user-editable YAML catalog, env-var override, local telemetry, auto-filter of unauthed/depleted providers. All tunable via BRAMHASHIV_* env vars.

v1.0+ roadmap: TUI plugin with /model dialog picker + /route debug command, shared catalog registry, confidence prompts.

See docs/opencode-plugin-audit.md for the OpenCode plugin API findings that shaped v1's architecture.

🚀 Install

One-liner (recommended)

opencode plugin bramhashiv

OpenCode pulls the package from npm and auto-registers it in your global config. Done.

Manual via npm

# pick your favourite — bun, npm, pnpm, yarn all work
bun add bramhashiv
npm install bramhashiv
pnpm add bramhashiv

Then add to your OpenCode config at ~/.config/opencode/opencode.jsonc:

{
  "$schema": "https://opencode.ai/config.json",
  "plugin": ["bramhashiv"]
}

From source (development / contributing)

git clone https://github.com/fir3storm/bramhashiv-ai.git
cd bramhashiv-ai
bun install
bun test          # 193 unit + harness + golden-set tests

Point OpenCode at your local clone:

{
  "plugin": [
    "file:///absolute/path/to/bramhashiv-ai/src/plugin-server.ts"
  ]
}

First-run behavior

On first activation, BramhaShiv copies config/default-catalog.yaml to ~/.config/bramhashiv/catalog.yaml — edit freely, hot-reloaded on save.

✅ Verify the plugin loaded:

opencode debug config --print-logs --log-level INFO

You should see service=plugin name=bramhashiv loading plugin (or path=...bramhashiv... for source installs).

📦 Package on npm: npmjs.com/package/bramhashiv

🔑 Classifier API key

The classifier auto-reads your Google API key from OpenCode's ~/.local/share/opencode/auth.json (set when you ran opencode providers login for Google). No extra setup needed if Google is already authed as type: api.

To override (e.g. for CI or a different key):

export GOOGLE_GENERATIVE_AI_API_KEY=<your-key>
# or
export GEMINI_API_KEY=<your-key>

Without any key, BramhaShiv falls back to neutral weights (still routes, but less informed).

🔐 Provider auth (one-time, per provider)

BramhaShiv only routes to providers OpenCode has credentials for. Run each you want:

opencode providers login -p anthropic     # OAuth — uses your Claude Max subscription
opencode providers login -p google        # paste a Gemini API key from https://ai.google.dev
opencode providers login -p huggingface   # paste an HF read token from https://huggingface.co/settings/tokens
opencode providers login -p openrouter    # paste an OpenRouter API key from https://openrouter.ai/keys

Stacking strategy (so you effectively never run out of credits):

Tier	Provider	Why it matters
1	OpenAI ChatGPT (Plus / Team / Pro via OAuth)	Subscription-backed; access GPT-5.x family without an API key. 5h reset cycle.
1	Anthropic (API key)	Pay-per-token but cheap (Sonnet ~$3/M input). Note: Claude Pro/Max subscriptions do NOT grant API access.
2	Google Gemini free	~1500 Flash + ~50 Pro requests/day, daily reset.
3	OpenRouter free	~200 req/day, daily reset. Aggregator: many models behind one key (Llama, DeepSeek-R1, Qwen, etc.).
4	HuggingFace free	Monthly quota; resets 1st of each month.

Each tier has an independent quota on a different reset cycle — three or four pools is enough redundancy that you rarely hit a wall during normal use. The router auto-skips providers without credentials and any model that returns 402/429/503/529 mid-session.

💻 Usage

Just use OpenCode normally. BramhaShiv hooks chat.message and rewrites the model on every user turn.

📌 Pinning a model

Two ways to pin the router to a specific model:

Env var (session-wide):

BRAMHASHIV_PIN=google/gemini-flash-latest opencode run "your task"

State file (persistent): edit ~/.config/bramhashiv/state.json:

{
  "pinned_model_id": "google/gemini-flash-latest",
  "last_label": null,
  "last_classifier": null,
  "last_route_debug": null
}

Clear the pin by setting pinned_model_id to null or unsetting the env var.

🎛️ Tuning the catalog

Open ~/.config/bramhashiv/catalog.yaml and adjust any model's scores (0–10 per trait). Hot-reloaded, no restart.

Verify model IDs against OpenCode's actual list:

opencode models openai
opencode models google
opencode models openrouter
opencode models huggingface
opencode models anthropic

If a model in your catalog isn't in OpenCode's list, dispatch will fail with ProviderModelNotFoundError. BramhaShiv exposes a pure validateCatalogModelIds() helper for tooling that wants to compare a catalog against provider model lists before runtime.

Route debugging

Every route decision is persisted to last_route_debug in ~/.config/bramhashiv/state.json. It includes the chosen model, ranked candidates, classifier weights, fallback state, and unavailable models. The handleRouteCommand() formatter can render this snapshot for a TUI /route command or any future wrapper that can read BramhaShiv state.

🧠 How routing works

BramhaShiv operates in 5 layers, each improving routing accuracy:

Workspace analysis — On activation, scans your project (up to 500 files) to detect languages, frontend/backend presence, tests, and git repo. Injected into all classification prompts for project-aware routing.
Classify — Gemini Flash reads your task + workspace summary and outputs six trait weights in 0..1.
Score — Each catalog model is dot-product scored against the trait weights, then adjusted by three real-time signals:
- Provider health: models with high error rates or slow latency get score penalties (up to 25%).
- Regeneration intelligence: if you've regenerated this same task before, the previous model's score is penalized (escalating 5%→30%).
- Learned adjustments: accumulated deltas from past task outcomes auto-tune the catalog base scores over time.
Dispatch — Top-ranked model runs the task. If rate-limited or unavailable, falls back through the ranking.
Learn — On task completion, the outcome (success, latency, regenerations) feeds back into the learning system and health monitor for future routing decisions.

🛡️ Availability filtering

Two layers keep the router from picking models you can't reach:

🔐 At activation — providers missing from ~/.local/share/opencode/auth.json are dropped. No more ProviderModelNotFoundError or auth-not-configured failures.
🚧 During the session — when a dispatch fails with ProviderAuthError or an APIError with status 402 / 429 / 503 / 529, the offending model is added to the session's unavailable set. Next turns route around it. Useful when HF free quota depletes mid-session or a provider rate-limits you.

Unavailable state is per OpenCode process — it resets when you restart. If you need it persistent, edit your catalog or use BRAMHASHIV_PIN.

🔧 Configuration

All tunable hyperparameters have sensible defaults and can be overridden via environment variables:

# Learning system
BRAMHASHIV_LEARNING_RATE=0.05       # how fast scores adjust per outcome (0-1)
BRAMHASHIV_LEARNING_DECAY=0.99      # decay factor for past adjustments
BRAMHASHIV_LEARNING_CAP=2.0         # max absolute adjustment per model+trait

# Health monitoring
BRAMHASHIV_HEALTH_DECAY=0.7         # EMA alpha for latency averaging
BRAMHASHIV_HEALTH_LATENCY_BASELINE=30000  # ms considered "slow"

# Regeneration tracking
BRAMHASHIV_SIMILARITY_THRESHOLD=0.7 # Jaccard threshold for task matching
BRAMHASHIV_REGENERATION_TTL_MS=3600000  # how long to remember regenerations

# Planner
BRAMHASHIV_PLANNER_TIMEOUT_MS=10000 # max time for task decomposition
BRAMHASHIV_COMPLEXITY_THRESHOLD=3   # heuristic score to trigger planning

# Classifier
BRAMHASHIV_CLASSIFIER_TIMEOUT_MS=8000  # Gemini Flash max wait

# Error tracking
BRAMHASHIV_ERROR_TTL_MODEL_MS=3600000   # per-model rate-limit TTL
BRAMHASHIV_ERROR_TTL_ACCOUNT_MS=43200000 # account-level quota TTL

# Workspace scanner
BRAMHASHIV_MAX_FILES_SCAN=500      # max files to scan for workspace analysis
BRAMHASHIV_MAX_DEPTH_SCAN=4        # max directory depth

🔧 Development

bun install
bun test            # 193 unit + harness + golden-set tests
bun run typecheck
BRAMHASHIV_SMOKE=1 bun test tests/providers.smoke.test.ts   # real Gemini call

🛣️ v2 Roadmap

🤖 Multi-turn orchestration — executing planner sub-tasks sequentially with different models.
❓ Confidence prompts — confirm with user when classifier is uncertain.
🌐 Shared catalog registry — community-tuned configs for different tech stacks.
📊 Learning dashboard — /stats command showing model performance over time.

👤 Author

Made by Abhirup Guha.

🪔 Proudly created in 🇮🇳, by an Indian — open-sourced for the betterment of the AI future.

🔒 Security

Your credentials never touch this repo. All API keys, OAuth tokens, and session data live in your home directory — ~/.local/share/opencode/auth.json and ~/.config/bramhashiv/. Cloning this repo gives you zero credentials; you must auth your own providers via opencode providers login. See SECURITY.md for the full threat model and a credential scan you can run against any clone.

📜 License

_{If this saved you time, ⭐ star it on GitHub.}

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github		.github
config		config
docs		docs
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
bun.lock		bun.lock
bunfig.toml		bunfig.toml
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🕉️ BramhaShiv AI

Smart multi-provider coding router for OpenCode

✨ What it does

📦 Scope

🚀 Install

One-liner (recommended)

Manual via npm

From source (development / contributing)

First-run behavior

🔑 Classifier API key

🔐 Provider auth (one-time, per provider)

💻 Usage

📌 Pinning a model

🎛️ Tuning the catalog

Route debugging

🧠 How routing works

🛡️ Availability filtering

🔧 Configuration

🔧 Development

🛣️ v2 Roadmap

👤 Author

🔒 Security

📜 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🕉️ BramhaShiv AI

Smart multi-provider coding router for OpenCode

✨ What it does

📦 Scope

🚀 Install

One-liner (recommended)

Manual via npm

From source (development / contributing)

First-run behavior

🔑 Classifier API key

🔐 Provider auth (one-time, per provider)

💻 Usage

📌 Pinning a model

🎛️ Tuning the catalog

Route debugging

🧠 How routing works

🛡️ Availability filtering

🔧 Configuration

🔧 Development

🛣️ v2 Roadmap

👤 Author

🔒 Security

📜 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages