This is an enhanced version of the official Gemini CLI with Ollama integration, allowing you to run powerful language models locally on your machine. This extension provides all the capabilities of the original Gemini CLI while adding support for local AI inference through Ollama.
- π Privacy-First: Run AI models completely offline on your local machine
- β‘ Performance: Direct local inference without network latency
- π― Model Flexibility: Support for Llama, Mistral, Qwen, Gemma, and many other models
- π οΈ Full Compatibility: All original Gemini CLI features and tools work seamlessly
- π¨ Custom Branding: Distinctive Ollama-themed banners and interface
- π Easy Switching: Switch between Ollama and Gemini models as needed
This repository contains the Gemini CLI, a command-line AI workflow tool that connects to your tools, understands your code and accelerates your workflows.
With the Gemini CLI you can:
- Query and edit large codebases in and beyond Gemini's 1M token context window.
- Generate new apps from PDFs or sketches, using Gemini's multimodal capabilities.
- Automate operational tasks, like querying pull requests or handling complex rebases.
- Use tools and MCP servers to connect new capabilities, including media generation with Imagen, Veo or Lyria
- Ground your queries with the Google Search tool, built in to Gemini.
-
Prerequisites: Ensure you have Node.js version 18 or higher installed.
-
Run the CLI: Execute the following command in your terminal:
npx https://github.com/google-gemini/gemini-cli
Or install it with:
npm install -g @google/gemini-cli gemini
-
Pick a color theme
-
Authenticate: When prompted, sign in with your personal Google account. This will grant you up to 60 model requests per minute and 1,000 model requests per day using Gemini.
You are now ready to use the Gemini CLI!
If you need to use a specific model or require a higher request capacity, you can use an API key:
-
Generate a key from Google AI Studio.
-
Set it as an environment variable in your terminal. Replace
YOUR_API_KEYwith your generated key.export GEMINI_API_KEY="YOUR_API_KEY"
For other authentication methods, including Google Workspace accounts, see the authentication guide.
- Install Ollama: Download and install Ollama from ollama.ai
- Pull a Model: Download a language model (e.g.,
ollama pull qwen2.5:7borollama pull llama3.1:8b) - Start Ollama Server: Run
ollama serveto start the local server
-
Install this Extension:
git clone <this-repository> cd gemini-cli npm install npm run build npm link
-
Configure for Ollama:
export GEMINI_AUTH_TYPE=ollama export OLLAMA_MODEL=qwen2.5:7b # or your preferred model export OLLAMA_BASE_URL=http://localhost:11434 # default Ollama URL
-
Start the CLI:
gemini
You'll see the distinctive Ollama extension banner and can start using local AI models immediately!
- Custom Banners: The CLI automatically displays Ollama-themed ASCII art when using Ollama
- Banner Customization: Use
/banner ollamato switch between different banner styles - Local Privacy: All processing happens on your machine - no data sent to external servers
- Model Management: Switch models by changing the
OLLAMA_MODELenvironment variable
You can easily switch between local Ollama models and Google's Gemini:
# Use Ollama (local)
export GEMINI_AUTH_TYPE=ollama
gemini
# Use Gemini (cloud)
unset GEMINI_AUTH_TYPE # or set to 'oauth' or 'api_key'
geminiOnce the CLI is running, you can start interacting with Gemini from your shell.
You can start a project from a new directory:
cd new-project/
gemini
> Write me a Gemini Discord bot that answers questions using a FAQ.md file I will provideOr work with an existing project:
git clone https://github.com/google-gemini/gemini-cli
cd gemini-cli
gemini
> Give me a summary of all of the changes that went in yesterday- For Ollama users: See BUILD_AND_INSTALL.md for detailed installation instructions
- Quick setup: Use the automated install.sh script for one-command installation
- Learn how to contribute to or build from the source.
- Explore the available CLI Commands.
- Check out Banner Customization for Ollama-specific UI options
- If you encounter any issues, review the Troubleshooting guide.
- For more comprehensive documentation, see the full documentation.
- Take a look at some popular tasks for more inspiration.
Head over to the troubleshooting guide if you're having issues.
- π Privacy is critical: Sensitive code or data that shouldn't leave your machine
- π Offline work: No internet connection or unreliable connectivity
- β‘ Low latency needed: Real-time applications requiring instant responses
- π° Cost optimization: Avoid API usage costs for high-volume tasks
- π― Specific models: Need access to particular open-source models
- π§ Maximum capability: Need the most advanced reasoning and multimodal features
- π Large context: Working with very large codebases (1M+ tokens)
- π Web grounding: Need real-time web search integration
- π Latest features: Want access to cutting-edge Gemini capabilities
- Extension Version: v0.1.5
- Base Gemini CLI: Compatible with latest official release
- Unique Features: Ollama integration, custom banners, local AI support
Start by cding into an existing or newly-cloned repository and running gemini.
> Describe the main pieces of this system's architecture.
> What security mechanisms are in place?
> Implement a first draft for GitHub issue #123.
> Help me migrate this codebase to the latest version of Java. Start with a plan.
Use MCP servers to integrate your local system tools with your enterprise collaboration suite.
> Make me a slide deck showing the git history from the last 7 days, grouped by feature and team member.
> Make a full-screen web app for a wall display to show our most interacted-with GitHub issues.
> Convert all the images in this directory to png, and rename them to use dates from the exif data.
> Organise my PDF invoices by month of expenditure.
For details on the terms of service and privacy notice applicable to your use of Gemini CLI, see the Terms of Service and Privacy Notice.
