Add performance optimizations and LRU caching #44

benzntech · 2025-12-05T07:51:28Z

Summary

This PR implements three key performance optimizations to improve response times and reduce API quota consumption:

1. LRU Response Cache (`src/utils/responseCache.ts`)

Implements LRU caching for Gemini API responses
30-minute TTL ensures fresh data
10MB max cache size with automatic eviction
Near-instant responses for repeated queries
Significant reduction in API quota usage

2. Eliminate Code Duplication (`src/utils/geminiExecutor.ts`)

Introduced buildGeminiArgs() helper function
DRY principle for CLI argument construction
Shared between main execution and Flash model fallback
Reduces maintenance burden and potential bugs
Added GeminiCLIOptions interface for type safety

3. Array Buffer Performance (`src/utils/commandExecutor.ts`)

Replaced O(n²) string concatenation with O(n) array buffers
Significant performance improvement for large command outputs
More efficient memory usage during streaming operations

Additional Improvements

Enhanced CLI flag support in constants.ts
Updated tools (ask-gemini.tool.ts, brainstorm.tool.ts) to use new options interface
Dependency cleanup in package.json and package-lock.json

Testing

All existing functionality remains backward compatible
Tested with MCP inspector and Playwright
Verified caching behavior and eviction policies
Confirmed Flash model fallback still works correctly

Performance Impact

Response time: Near-instant for cached queries (vs. 2-5s for API calls)
API quota: Significantly reduced for repeated queries
Memory: Efficient with 10MB cap and automatic eviction
Large outputs: O(n) performance vs. previous O(n²)

Files Changed

src/utils/responseCache.ts (new file)
src/utils/geminiExecutor.ts (refactored)
src/utils/commandExecutor.ts (optimized)
src/constants.ts (enhanced flags)
src/tools/ask-gemini.tool.ts (updated interface)
src/tools/brainstorm.tool.ts (updated interface)
src/index.ts (minor cleanup)
package.json & package-lock.json (dependencies)

All changes maintain backward compatibility while significantly improving performance.

This PR implements three key performance optimizations: 1. **LRU Response Cache** (responseCache.ts) - Caches Gemini API responses with 30-minute TTL - Reduces API quota consumption for repeated queries - 10MB max cache size with automatic eviction - Near-instant responses for cached queries 2. **Eliminate Code Duplication** (geminiExecutor.ts) - New buildGeminiArgs() helper function - DRY principle for CLI argument construction - Shared between main execution and fallback paths - Reduces maintenance burden and potential bugs 3. **Array Buffer Performance** (commandExecutor.ts) - Replace O(n²) string concatenation with O(n) array buffers - Significant performance improvement for large outputs - More efficient memory usage during streaming Additional improvements: - Added GeminiCLIOptions interface for type safety - Enhanced CLI flag support in constants.ts - Updated tools to use new options interface - Dependency cleanup in package.json These optimizations maintain backward compatibility while significantly improving performance for repeated queries and large command outputs.

gemini-code-assist · 2025-12-05T07:51:51Z

Summary of Changes

Hello @benzntech, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the application's performance and efficiency by implementing a robust LRU caching mechanism for Gemini API responses, streamlining CLI argument construction to reduce code duplication, and optimizing command output processing with array buffers. These changes aim to improve response times, lower API quota consumption, and provide a more flexible and maintainable codebase.

Highlights

LRU Response Caching: Introduced a new LRU cache (lru-cache dependency) for Gemini API responses, providing near-instant replies for repeated queries, reducing API quota usage, and featuring a 30-minute TTL and 10MB max size.
Code Duplication Elimination: Refactored geminiExecutor.ts by introducing a buildGeminiArgs() helper function and GeminiCLIOptions interface, centralizing CLI argument construction and improving maintainability.
Array Buffer Performance: Optimized commandExecutor.ts by replacing inefficient O(n²) string concatenation with O(n) array buffers for handling large command outputs, leading to significant performance and memory efficiency gains.
Enhanced CLI Flags: Expanded constants.ts with new CLI flags and default values for features like YOLO mode, approval modes, output formats, and directory inclusion, enhancing the flexibility of Gemini CLI interactions.
Tool Interface Updates: Updated ask-gemini.tool.ts and brainstorm.tool.ts to utilize the new GeminiCLIOptions interface, ensuring consistency and type safety across tool arguments.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant performance optimizations and improves code structure. The addition of an LRU cache for API responses is a great feature to reduce latency and quota usage. The implementation in src/utils/responseCache.ts is solid. The refactoring in src/utils/geminiExecutor.ts to use a helper function and an options object greatly improves maintainability and readability. The optimization in src/utils/commandExecutor.ts to use array buffers instead of string concatenation for command output is a correct and effective performance enhancement. I've found a couple of areas for improvement, one related to correctness in commandExecutor.ts and a minor refinement in responseCache.ts.

src/utils/commandExecutor.ts

src/utils/responseCache.ts

- Corrected slash commands to match actual implementation (/ask-gemini, /brainstorm, /fetch-chunk, /Help, /ping) - Added comprehensive documentation for all ask-gemini flags (Phase 1 & 2) - Documented brainstorm tool with methodology options - Added Performance Features section highlighting LRU cache, array buffers, and chunking - Removed outdated /analyze and /sandbox commands that don't exist in code - Added practical examples for each slash command

- Install vitest and @vitest/coverage-v8 for testing - Create vitest.config.ts with coverage configuration - Add test helper utilities for mocking child processes - Create server.test.ts for MCP server initialization tests - Create tools.test.ts for tool registry and schema validation tests - Create ask-gemini.test.ts for ask-gemini tool argument validation - Update package.json with test scripts (test, test:watch, test:coverage) Test coverage: - 17/17 passing: ask-gemini tool argument validation - 3/18 passing: tool registry tests (schema validation passing) - Server tests need mock adjustment but infrastructure is ready Based on testing patterns from Claude Code MCP server tests.

gemini-code-assist bot reviewed Dec 5, 2025

View reviewed changes

src/utils/commandExecutor.ts Show resolved Hide resolved

src/utils/responseCache.ts Show resolved Hide resolved

benzntech added 2 commits December 5, 2025 13:34

benzntech force-pushed the feature/performance-optimizations branch from bf8a880 to a333ed3 Compare December 5, 2025 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add performance optimizations and LRU caching #44

Add performance optimizations and LRU caching #44

Uh oh!

benzntech commented Dec 5, 2025

Uh oh!

gemini-code-assist bot commented Dec 5, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add performance optimizations and LRU caching #44

Are you sure you want to change the base?

Add performance optimizations and LRU caching #44

Uh oh!

Conversation

benzntech commented Dec 5, 2025

Summary

1. LRU Response Cache (src/utils/responseCache.ts)

2. Eliminate Code Duplication (src/utils/geminiExecutor.ts)

3. Array Buffer Performance (src/utils/commandExecutor.ts)

Additional Improvements

Testing

Performance Impact

Files Changed

Uh oh!

gemini-code-assist bot commented Dec 5, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

1. LRU Response Cache (`src/utils/responseCache.ts`)

2. Eliminate Code Duplication (`src/utils/geminiExecutor.ts`)

3. Array Buffer Performance (`src/utils/commandExecutor.ts`)