A multi-tenant API proxy for LLM services with integrated authentication, caching, and realtime capabilities.
Vibe Server uses a modular "stacks" architecture powered by vibe-compiler to compile specifications into runnable code:
stacks/
- Feature specifications and documentationoutput/
- Generated implementation codevibec.json
- Build configuration
- Multi-tenant Koa-based web server
- Tenant-specific configuration in Redis
- OpenAI-compatible chat API
- Multiple LLM provider support
- Token usage tracking
- Bidirectional streaming for voice/text
- Session management
- Tool integration
- Tenant-configurable caching
- TTL-based response caching
- Fee adjustments for cached responses
- OAuth with multiple providers (Google, Apple)
- JWT token management
- Stripe subscription integration
Each stack has corresponding tests in the stacks/tests/
directory.
The application uses vibec
to assemble the final application from stack specifications.