A Multi-Provider Ollama Proxy Server that allows using JetBrains AI Assistant with third-party commercial LLMs like OpenAI, Google Gemini, and OpenRouter, taking advantage of their free tier usage. Especially for Kimi K2 and Deepseek R1
This proxy server translates requests from the Ollama API format to OpenAI, Google Gemini, or OpenRouter API formats, allowing you to use these commercial LLMs with tools that support the Ollama API, such as JetBrains AI Assistant.
The server runs on port 11434 by default (the same port as Ollama) and requires API keys for OpenAI, Gemini, and/or OpenRouter, configured via environment variables.
- Seamless integration with JetBrains AI Assistant
- Support for multiple LLM providers:
- OpenAI models
- Google Gemini models
- OpenRouter models
 
- Compatible with Ollama API endpoints:
- /api/chat- for chat completions
- /api/generate- for text generation
- /api/tags- for listing available models
- /api/version- for version information
 
The proxy server supports a variety of models from OpenAI, Google Gemini, and OpenRouter. The default configurations are:
- gpt-4o-mini
- gpt-4.1-mini
- gpt-4.1-nano
- gemini-2.5-flash
- gemini-2.5-flash-lite-preview-06-17
- deepseek-r1
You can customize the available models by creating a models.json file in the directory where you run the proxy server.
This file should contain a JSON object where keys are the model names you want to expose, and values are objects
specifying the provider (e.g., openai, google, openrouter) and the actual model name as expected by the
respective API.
If a models.json file is found in the current working directory, the proxy will load models from it. Otherwise, it
will use the built-in default models.
Example models.json:
{
    "my-custom-gpt": { "provider": "openai", "model": "gpt-4o-mini" },
    "my-gemini-pro": { "provider": "google", "model": "gemini-pro" },
    "my-openrouter-model": { "provider": "openrouter", "model": "mistralai/mistral-7b-instruct-v0.2" }
}This allows you to rename models, add new ones supported by the providers, or remove models you don't intend to use.
- Node.js >= 18.0.0 or Bun >= 1.2.0
- API key for OpenAI, Google Gemini, and/or OpenRouter
# Clone the repository
git clone https://github.com/xrip/ollama-api-proxy.git
cd ollama-api-proxy
# Install dependencies
npm install
# Create .env file with your API keys
echo "OPENAI_API_KEY=your_openai_api_key" > .env
echo "GEMINI_API_KEY=your_gemini_api_key" >> .env
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env
echo "OPENROUTER_API_URL=your_openrouter_api_url" >> .env  # optional, default is https://openrouter.ai/api/v1
# Start the server
npm start# Clone the repository
git clone https://github.com/xrip/ollama-api-proxy.git
cd ollama-api-proxy
# Create .env file with your API keys
echo "OPENAI_API_KEY=your_openai_api_key" > .env
echo "GEMINI_API_KEY=your_gemini_api_key" >> .env
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env
echo "OPENROUTER_API_URL=your_openrouter_api_url" >> .env  # optional, default is https://openrouter.ai/api/v1
# Build and run the Docker container
docker build -t ollama-proxy .
docker run -p 11434:11434 --env-file .env ollama-proxy# Create a directory for your configuration
mkdir ollama-proxy-config
cd ollama-proxy-config
# Create .env file with your API keys
echo "OPENAI_API_KEY=your_openai_api_key" > .env
echo "GEMINI_API_KEY=your_gemini_api_key" >> .env
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env
echo "OPENROUTER_API_URL=your_openrouter_api_url" >> .env  # optional, default is https://openrouter.ai/api/v1
# Run the proxy server using npx
npx ollama-api-proxy
# Alternatively, you can specify a specific version
# npx ollama-api-proxy@1.0.0# Create a directory for your configuration
mkdir ollama-proxy-config
cd ollama-proxy-config
# Create .env file with your API keys
echo "OPENAI_API_KEY=your_openai_api_key" > .env
echo "GEMINI_API_KEY=your_gemini_api_key" >> .env
echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env
echo "OPENROUTER_API_URL=your_openrouter_api_url" >> .env  # optional, default is https://openrouter.ai/api/v1
# Run the proxy server using bunx
bunx ollama-api-proxy
# Alternatively, you can specify a specific version
# bunx ollama-api-proxy@1.0.0The proxy server is configured using environment variables:
- PORT: The port on which the server will run (default: 11434)
- OPENAI_API_KEY: Your OpenAI API key (required for OpenAI models)
- GEMINI_API_KEY: Your Google Gemini API key (required for Gemini models)
- OPENROUTER_API_KEY: Your OpenRouter API key (required for OpenRouter models)
- OPENROUTER_API_URL: Your OpenRouter API URL (optional for OpenRouter models)
- NODE_ENV: Set to- productionfor production use or- developmentfor development
You can set these variables in a .env file in the project root.
- Start the Ollama Proxy server
- Configure JetBrains AI Assistant to use Ollama
- Set the Ollama server URL to http://localhost:11434
- Select one of the available models (e.g., gpt-4o,gemini-2.5-flash,deepseek-r1)
# Run in development mode with hot reloading (requires Bun)
bun run devThis project is licensed under the MIT License - see the LICENSE file for details.