Ekai Gateway

Multi-provider AI proxy with usage dashboard supporting Anthropic, OpenAI, xAI, and OpenRouter models through OpenAI-compatible and Anthropic-compatible APIs.

Designed for self-hosted personal use - run your own instance to securely proxy AI requests using your API keys.

Features

🤖 Multi-provider: Anthropic + OpenAI + xAI + OpenRouter models
🔄 Dual APIs: OpenAI-compatible + Anthropic-compatible endpoints
🔀 Cost-optimized routing: Automatic selection of cheapest provider for each model
💰 Usage tracking: Track token usage and costs with visual dashboard
🗄️ Database storage: SQLite database for persistent usage tracking
📊 Analytics dashboard: Real-time cost analysis and usage breakdowns

🎥 Demo Video

Quick Start (Beta)

Option 1: Using npm

# 1. Install dependencies
npm install

# 2. Setup environment variables
cp .env.example .env
# Edit .env and add at least one API key (see .env.example for details)

# 3. Build and start the application
npm run build
npm start

Option 2: Using Docker

# 1. Setup environment variables
cp .env.example .env
# Edit .env and add at least one API key (see .env.example for details)

# 2. Start with Docker Compose
docker compose up --build -d

Important: The dashboard is initially empty. After setup, send a query using your own client/tool (IDE, app, or API) through the gateway; usage appears once at least one request is processed.

Access Points:

Gateway API: http://localhost:3001
Dashboard UI: http://localhost:3000
Detailed setup steps live in docs/getting-started.md; check docs/ for additional guides.

Populate the Dashboard

Point your client/tool to the gateway (http://localhost:3001 or http://localhost:3001/v1), see integration guides below.
Send a query using your usual workflow; both OpenAI-compatible and Anthropic-compatible endpoints are tracked.
Open http://localhost:3000 to view usage and costs after your first request.

Required: At least one API key from Anthropic, OpenAI, xAI, or OpenRouter (see .env.example for configuration details).

Integration Guides

🤖 Claude Code Integration

Use the gateway with Claude Code for multi-provider AI assistance:

# Point Claude Code to the gateway
export ANTHROPIC_BASE_URL="http://localhost:3001"
export ANTHROPIC_MODEL="grok-code-fast-1"  # or "gpt-4o","claude-sonnet-4-20250514"

# Start Claude Code as usual
claude

📖 Complete Claude Code Guide →

💻 Codex Integration

Use the gateway with Codex for OpenAI-compatible development tools:

# Point Codex to the gateway
export OPENAI_BASE_URL="http://localhost:3001/v1"

# Start Codex as usual  
codex

📖 Complete Codex Guide →

Beta Testing Notes

🚧 This is a beta release - please report any issues or feedback!

Known Limitations:

Some edge cases in model routing may exist

Getting Help:

Check the logs in gateway/logs/gateway.log for debugging
Ensure your API keys have sufficient credits
Test with simple requests first before complex workflows

Project Structure

ekai-gateway/
├── gateway/          # Backend API and routing
├── ui/              # Dashboard frontend
├── shared/          # Shared types and utilities
└── package.json     # Root package configuration

API Endpoints

POST /v1/chat/completions  # OpenAI-compatible chat endpoint
POST /v1/messages          # Anthropic-compatible messages endpoint
POST /v1/responses         # OpenAI Responses endpoint
GET  /usage               # View token usage and costs
GET  /health              # Health check endpoint

# OpenAI-compatible endpoint (works with all providers)
curl -X POST http://localhost:3001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "Hello"}]}'

# Use Claude models via OpenAI-compatible endpoint
curl -X POST http://localhost:3001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "claude-3-5-sonnet-20241022", "messages": [{"role": "user", "content": "Hello"}]}'

# Use xAI Grok models
curl -X POST http://localhost:3001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "grok-code-fast", "messages": [{"role": "user", "content": "Hello"}]}'

# Anthropic-compatible endpoint
curl -X POST http://localhost:3001/v1/messages \
  -H "Content-Type: application/json" \
  -d '{"model": "claude-3-5-sonnet-20241022", "max_tokens": 100, "messages": [{"role": "user", "content": "Hello"}]}'

# OpenAI Responses endpoint
curl -X POST http://localhost:3001/v1/responses \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o-mini", "input": "Say hi in one short sentence.", "temperature": 0.7, "max_output_tokens": 128}'

# Both endpoints support all models and share conversation context
# Client A uses OpenAI format, Client B uses Anthropic format - same conversation!

# Check usage and costs
curl http://localhost:3001/usage

Model Routing (Cost-Optimized)

The proxy uses cost-based optimization to automatically select the cheapest available provider:

Special routing: Grok models (grok-code-fast, grok-beta) → xAI (if available)
Cost optimization: All other models are routed to the cheapest provider that supports them
Provider fallback: Graceful fallback if preferred provider is unavailable

Supported providers:

Anthropic: Claude models (direct API access)
OpenAI: GPT models (direct API access)
xAI: Grok models (direct API access)
OpenRouter: Multi-provider access with provider/model format

Multi-client proxy: Web apps, mobile apps, and scripts share conversations across providers with automatic cost tracking and optimization.

Production Commands

npm run build  # Build TypeScript for production
npm start      # Start both gateway and dashboard

Individual services:

npm run start:gateway  # Gateway API only (port 3001)
npm run start:ui       # Dashboard UI only (port 3000)

Development

npm run dev    # Start both gateway and dashboard in development mode

Individual services:

cd gateway && npm run dev    # Gateway only (port 3001)
cd ui/dashboard && npm run dev    # Dashboard only (port 3000)

License

Licensed under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.github/workflows		.github/workflows
docs		docs
gateway		gateway
model_catalog		model_catalog
shared/types		shared/types
ui/dashboard		ui/dashboard
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ekai Gateway

Features

🎥 Demo Video

Quick Start (Beta)

Populate the Dashboard

Integration Guides

🤖 Claude Code Integration

💻 Codex Integration

Beta Testing Notes

Project Structure

API Endpoints

Model Routing (Cost-Optimized)

Production Commands

Development

License

About

Uh oh!

Contributors 3

Uh oh!

Languages

License

ekailabs/ekai-gateway

Folders and files

Latest commit

History

Repository files navigation

Ekai Gateway

Features

🎥 Demo Video

Quick Start (Beta)

Populate the Dashboard

Integration Guides

🤖 Claude Code Integration

💻 Codex Integration

Beta Testing Notes

Project Structure

API Endpoints

Model Routing (Cost-Optimized)

Production Commands

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 3

Uh oh!

Languages