feat: integrate models.dev as dynamic model source by tommy-ca · Pull Request #1117 · eyaltoledano/claude-task-master

tommy-ca · 2025-08-10T03:44:17Z

Summary

Integrates https://models.dev/api.json as primary source for AI model information
Maintains complete backward compatibility with existing CLI interface
Implements intelligent caching and graceful fallback to static models

Implementation Details

ModelsDevService: Handles API integration with 24-hour caching (memory + disk)
ModelMerger: Intelligently merges static and dynamic models, preferring fresh data
getAllAvailableModels(): New async function in config-manager for dynamic loading
Transparent Integration: Existing getAvailableModels() unchanged for compatibility
Comprehensive Tests: Integration tests covering online/offline scenarios

Key Features

✅ Uses curl https://models.dev/api.json as requested
✅ Specs-driven development with comprehensive documentation
✅ Zero breaking changes to existing CLI interface
✅ Graceful degradation: API → cache → stale cache → static models
✅ Model deduplication with preference for models.dev data
✅ Optional cache management utilities

Files Changed (7 total)

src/services/models-dev-service.js - Core API integration service
src/services/model-merger.js - Static/dynamic model merging logic
scripts/modules/config-manager.js - Added getAllAvailableModels() function
scripts/modules/task-manager/models.js - Integrated dynamic model loading
scripts/cache-models.js - Optional cache management utility
tests/integration/models-dev-integration.test.js - Comprehensive test suite
docs/SPEC-models-dev-integration.md - Technical specification

🤖 Generated with Claude Code

Summary by CodeRabbit

New Features
- Integrated dynamic AI model discovery from models.dev API, providing richer and more up-to-date model information.
- Added CLI script for managing and refreshing the models cache.
- Enhanced the models CLI command with new filtering options (by provider, free/paid status).
- Added support for Claude Code models with streamlined OAuth authentication via the Claude Code CLI.
- Provided detailed documentation for integrating Claude Code and dynamic models.
Bug Fixes
- Improved error handling and graceful fallback to static models if dynamic fetching fails.
Documentation
- Introduced comprehensive guides for models.dev integration and Claude Code setup.
Tests
- Added integration and unit tests for dynamic model fetching, merging, and Claude Code provider scenarios.
Chores
- Updated dependencies and .gitignore to support new features and cache management.

- Update AI SDK from v4.3.16 to v5.0.0-beta.34 - Add ai-sdk-provider-claude-code integration - Replace custom SDK with official ai-sdk-provider-claude-code package - Add OAuth token authentication via Claude Code CLI - Support for 'sonnet' and 'opus' models - Comprehensive integration tests and documentation - Direct integration test script for verification 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

- Updates @anthropic-ai/claude-code to latest version - Ensures compatibility with latest Claude Code CLI features - Maintains optional dependency status for graceful degradation 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

changeset-bot · 2025-08-10T03:44:20Z

⚠️ No Changeset found

Latest commit: 682b114

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

coderabbitai · 2025-08-10T03:44:29Z

Walkthrough

This update introduces dynamic AI model discovery in Task Master by integrating the external models.dev API, replacing the previous static model list. It adds new service and merging layers, implements caching, updates CLI commands with filtering options, and enhances Claude Code provider integration. Documentation and comprehensive tests are included.

Changes

Cohort / File(s)	Change Summary
Documentation: Model Integration & Claude Code `docs/SPEC-models-dev-integration.md`, `docs/claude-code-integration.md`	Adds detailed specifications for models.dev API integration and a comprehensive Claude Code integration guide, covering setup, authentication, usage, troubleshooting, and SDK usage.
Service Layer: Models.dev API Integration `src/services/models-dev-service.js`, `src/services/model-merger.js`	Introduces `ModelsDevService` for fetching, caching, and normalizing models.dev API data, and `ModelMerger` for merging and deduplicating static and dynamic models, exporting singleton instances for global use.
Static & Dynamic Model Management `scripts/modules/config-manager.js`, `scripts/modules/task-manager/models.js`	Adds `getAllAvailableModels` to merge static and dynamic models; updates model retrieval logic to prefer dynamic models, with fallback to static, and logs source of data.
CLI Enhancements: Model Listing & Cache Management `scripts/modules/commands.js`, `scripts/cache-models.js`	Enhances the `models` CLI command with provider and cost filters; adds a new script for cache management (refresh, clear, status) for models.dev data.
Claude Code Provider & SDK Integration `src/ai-providers/claude-code.js`, `package.json`	Switches Claude Code provider to use the new SDK package, clarifies authentication, improves error handling, adds model support introspection, and updates dependencies.
Testing: Integration & Unit `tests/integration/models-dev-integration.test.js`, `tests/integration/claude-code-optional.test.js`, `tests/unit/ai-providers/claude-code.test.js`	Adds integration tests for models.dev API and merging logic, and expands Claude Code provider tests for SDK and CLI scenarios.
Direct Claude Code Test Script `scripts/test-claude-code-direct.js`	Adds a script to directly test Claude Code SDK integration with generate and stream capabilities, including error handling and troubleshooting.
Git Ignore Update `.gitignore`	Ignores the `.taskmaster/models-cache.json` file used for models.dev cache.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant CLI/TaskMaster
    participant ConfigManager
    participant ModelsDevService
    participant ModelMerger
    participant StaticModels

    User->>CLI/TaskMaster: Request available models (e.g., via CLI)
    CLI/TaskMaster->>ConfigManager: getAllAvailableModels()
    ConfigManager->>ModelsDevService: searchModels()
    ModelsDevService->>ModelsDevService: Check cache (24h TTL)
    alt Cache valid
        ModelsDevService-->>ConfigManager: Return cached dynamic models
    else Cache expired or missing
        ModelsDevService->>ModelsDevService: Fetch from models.dev API
        ModelsDevService->>ModelsDevService: Save to cache
        ModelsDevService-->>ConfigManager: Return dynamic models
        Note right of ModelsDevService: On failure, fallback to stale cache or error
    end
    ConfigManager->>StaticModels: getAvailableModels()
    ConfigManager->>ModelMerger: mergeStaticAndDynamic(static, dynamic)
    ModelMerger-->>ConfigManager: Merged models
    ConfigManager-->>CLI/TaskMaster: All available models
    CLI/TaskMaster-->>User: Display models (with CLI filters if specified)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Task 104: Implement 'scope-up' and 'scope-down' CLI Commands for Dynamic Task Complexity Adjustment #1069: Introduces new CLI commands and supporting infrastructure for dynamic task complexity adjustment, which may interact with or build upon the enhanced model discovery and provider integration in this PR.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai generate unit tests to generate unit tests for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

- Add ModelsDevService for API integration with 24h caching - Add ModelMerger for intelligent static/dynamic model deduplication - Update config-manager with getAllAvailableModels() async function - Maintain backward compatibility with existing CLI interface - Add comprehensive integration tests and documentation - Support graceful fallback to static models when API unavailable 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

- Add missing haiku model definition to static sources - Ensures all three Claude Code models (opus, sonnet, haiku) are properly supported - Resolves validation warning when setting haiku as main model - SWE score: 40.0%, maxTokens: 200000, free pricing

tommy-ca · 2025-08-10T04:33:42Z

🔧 Latest Updates - Bug Fixes & Refinements

Recent commits added important fixes:

✅ Fix: Resolve empty log lines and claude-code model availability (`86b9d4c`)

Issues Resolved:

Empty [INFO] log lines - Fixed multiple log() calls missing log level parameters in:
- src/services/models-dev-service.js (10 calls fixed)
- src/services/model-merger.js (3 calls fixed)
Claude Code haiku model - Removed from supported models since Claude Code only supports opus and sonnet

Before:

Fetching current model configuration...
[INFO] 
[INFO] 
[INFO] 
[INFO] 
...
│ claude-code   │ haiku    │ 40.0% ★★☆ │ Free │

After:

Fetching current model configuration...
[INFO] [MODEL-MERGER] Merged 87 static + 505 dynamic → 559 total models

# claude-code now correctly shows only opus and sonnet

✅ All Tests Passing

Integration tests validate the fixes
No breaking changes to existing functionality
Clean, professional command output

These refinements ensure a polished user experience with the new models.dev integration.

- Remove haiku model from claude-code provider (only supports opus/sonnet) - Fix empty [INFO] log lines in models command output by adding proper log levels - Update models-dev-service.js: add log level parameters to all log calls - Update model-merger.js: add log level parameters to all log calls 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

tommy-ca · 2025-08-10T04:36:44Z

🧹 Cleaned Up Commit

Updated commit 2bdc226: Removed .taskmaster/ directory changes from the fix commit to keep project-specific configurations out of the feature branch.

✅ What's Included:

✅ Fixed empty [INFO] log lines in models command
✅ Corrected claude-code provider models (removed haiku)
✅ Enhanced log output with proper log levels
✅ All important fixes preserved

❌ What's Excluded:

❌ .taskmaster/config.json - Project-specific configuration
❌ .taskmaster/tasks/tasks.json - Project-specific tasks

The PR is now clean and focused on the core feature and bug fixes without including development environment artifacts.

- Add --filter-provider option to show models from specific providers - Add --exclude-provider option to hide models from providers (supports multiple) - Add --only-free option to show only free models - Add --only-paid option to show only paid models - Filters can be combined for refined model lists - Updated help documentation with filtering examples - Maintains backward compatibility with existing functionality Examples: task-master models --filter-provider claude-code task-master models --exclude-provider bedrock --exclude-provider openrouter task-master models --only-free task-master models --filter-provider anthropic --only-paid 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

tommy-ca · 2025-08-10T04:43:34Z

🎛️ New Feature: Provider & Cost Filters

Latest commit 682b114: Added comprehensive filtering capabilities to the models command for clean, focused model lists.

✨ New Filter Options:

🔍 Provider Filters:

--filter-provider <provider> - Show only models from specific provider
--exclude-provider <provider> - Exclude models from provider(s) - supports multiple!

💰 Cost Filters:

--only-free - Show only free models
--only-paid - Show only paid models

🚀 Key Features:

✅ Combinable filters - Mix and match for precise filtering
✅ Multiple excludes - Use --exclude-provider multiple times
✅ Clear messaging - Shows what filters are applied
✅ Backward compatible - All existing functionality preserved

📋 Usage Examples:

# Show only Anthropic models
task-master models --filter-provider anthropic

# Show only Claude Code models
task-master models --filter-provider claude-code  

# Exclude multiple providers
task-master models --exclude-provider bedrock --exclude-provider openrouter

# Show only free models
task-master models --only-free

# Combine filters: free Anthropic models
task-master models --filter-provider anthropic --only-free

Perfect for users who want clean, focused model lists without the overwhelming full catalog! 🎯

coderabbitai

Actionable comments posted: 37

🔭 Outside diff range comments (2)

package.json (1)
13-27: Add the full test script set per repository standards

Per coding_guidelines, include test:unit, test:integration, and test:ci scripts alongside existing ones.
   "scripts": {
     "test": "node --experimental-vm-modules node_modules/.bin/jest",
+    "test:unit": "node --experimental-vm-modules node_modules/.bin/jest tests/unit",
+    "test:integration": "node --experimental-vm-modules node_modules/.bin/jest tests/integration",
     "test:watch": "node --experimental-vm-modules node_modules/.bin/jest --watch",
     "test:coverage": "node --experimental-vm-modules node_modules/.bin/jest --coverage",
     "test:e2e": "./tests/e2e/run_e2e.sh",
+    "test:ci": "node --experimental-vm-modules node_modules/.bin/jest --ci --coverage",
     "test:e2e-report": "./tests/e2e/run_e2e.sh --analyze-log",
tests/unit/ai-providers/claude-code.test.js (1)
76-87: Expand assertions to validate commandName handling (optional)

You can assert that getClient accepts commandName without throwing, which you do. If feasible, also assert createClaudeCode was invoked once (ensuring integration wiring).
 it('should handle commandName parameter', () => {
   const client = provider.getClient({ 
     commandName: 'test-command' 
   });
   expect(client).toBeDefined();
+  const { createClaudeCode } = jest.mocked(await import('ai-sdk-provider-claude-code'));
+  expect(createClaudeCode).toHaveBeenCalledTimes(1);
 });
Note: If adding the above, move the import to a beforeAll and reference its mocked instance here to keep the test body synchronous.

coderabbitai · 2025-08-10T04:50:55Z

docs/claude-code-integration.md

+# Claude Code Integration Guide
+
+This guide covers how to use Task Master with Claude Code AI SDK integration for enhanced AI-powered development workflows.
+
+## Overview
+
+Claude Code integration allows Task Master to leverage the Claude Code CLI for AI operations without requiring direct API keys. The integration uses OAuth tokens managed by the Claude Code CLI itself.
+
+## Authentication Setup
+
+The Claude Code provider uses token authentication managed by the Claude Code CLI.


🧹 Nitpick (assertive)

Fix markdown spacing and code-fence language for top section

Headings and fenced blocks should be separated by blank lines, and fences should have a language.

-# Claude Code Integration Guide - -This guide covers how to use Task Master with Claude Code AI SDK integration for enhanced AI-powered development workflows. - -## Overview +# Claude Code Integration Guide + +This guide covers how to use Task Master with Claude Code AI SDK integration for enhanced AI-powered development workflows. + +## Overview

And for the first fenced block:

- ``` + ```bash # Installation method depends on your system # Follow Claude Code documentation for installation ```

Apply similarly throughout to address MD022/MD031/MD040.

🧰 Tools

🪛 LanguageTool

[grammar] ~1-~1: Use correct spacing
Context: # Claude Code Integration Guide This guide covers how to use Task Master...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

[grammar] ~3-~3: Use correct spacing
Context: ...ion for enhanced AI-powered development workflows. ## Overview Claude Code integration allows...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

[grammar] ~7-~7: Use correct spacing
Context: ...h tokens managed by the Claude Code CLI itself. ## Authentication Setup The Claude Code pr...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

[grammar] ~9-~9: Use correct spacing
Context: ...ude Code CLI itself. ## Authentication Setup The Claude Code provider uses token auth...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

[grammar] ~11-~11: Use correct spacing
Context: ...thentication managed by the Claude Code CLI. ### Prerequisites 1. **Install Claude Code ...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

🪛 markdownlint-cli2 (0.17.2)

3-3: Headings should be surrounded by blank lines
Expected: 1; Actual: 0; Below

(MD022, blanks-around-headings)

6-6: Headings should be surrounded by blank lines
Expected: 1; Actual: 0; Below

(MD022, blanks-around-headings)

7-7: Lists should be surrounded by blank lines

(MD032, blanks-around-lists)

🤖 Prompt for AI Agents

In docs/claude-code-integration.md around lines 1 to 11, fix markdown formatting by ensuring there are blank lines before and after headings and fenced code blocks. Add the appropriate language identifier (e.g., bash) to all fenced code blocks. Review the entire document to apply these spacing and code-fence language fixes consistently to comply with markdown linting rules MD022, MD031, and MD040.

coderabbitai · 2025-08-10T04:50:55Z

docs/claude-code-integration.md

+1. **Environment Variable** (optional): `CLAUDE_CODE_OAUTH_TOKEN`
+   - Useful for CI/CD environments or when you want to override the default token
+   - Not required if you've set up the CLI token
+
+2. **Claude Code CLI Token** (recommended): Token managed by `claude setup-token`
+   - Automatically used when available
+   - Most convenient for local development


🛠️ Refactor suggestion

⚠️ Potential issue

Environment variable naming mismatch with implementation

The doc recommends CLAUDE_CODE_OAUTH_TOKEN, but the provider’s getRequiredApiKeyName() returns CLAUDE_CODE_API_KEY (and isRequiredApiKey() returns false). Align docs and code on a single env var name, or clarify precedence.

Proposed options:

Update provider to look for CLAUDE_CODE_OAUTH_TOKEN (preferred for clarity with CLI OAuth).

Or, update this doc to reference CLAUDE_CODE_API_KEY if that’s what code actually honors.
Would you like a patch to wire getClient() to prefer CLAUDE_CODE_OAUTH_TOKEN?

Also applies to: 67-73

🧰 Tools

🪛 LanguageTool

[grammar] ~35-~35: Use correct spacing
Context: ...his order: 1. Environment Variable (optional): CLAUDE_CODE_OAUTH_TOKEN - Useful for CI/CD environments or when you want...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

[grammar] ~36-~36: There might be a mistake here.
Context: ...r when you want to override the default token - Not required if you've set up the CLI to...

(QB_NEW_EN_OTHER)

[grammar] ~37-~37: There might be a mistake here.
Context: ...- Not required if you've set up the CLI token 2. Claude Code CLI Token (recommended): T...

(QB_NEW_EN_OTHER)

[grammar] ~40-~40: Insert the missing word
Context: ...tup-token` - Automatically used when available - Most convenient for local developm...

(QB_NEW_EN_OTHER_ERROR_IDS_32)

[grammar] ~41-~41: There might be a mistake here.
Context: ...vailable - Most convenient for local development 3. Fallback: Error if neither is availabl...

(QB_NEW_EN_OTHER)

🤖 Prompt for AI Agents

In docs/claude-code-integration.md around lines 35 to 41 and also lines 67 to 73, the environment variable name CLAUDE_CODE_OAUTH_TOKEN mentioned in the documentation does not match the environment variable CLAUDE_CODE_API_KEY used in the provider code. To fix this, either update the provider code to use CLAUDE_CODE_OAUTH_TOKEN for consistency with the CLI OAuth naming or revise the documentation to reference CLAUDE_CODE_API_KEY to align with the current implementation. Choose one naming convention and ensure both the code and docs consistently use it, clarifying any precedence if needed.

coderabbitai · 2025-08-10T04:50:55Z

docs/claude-code-integration.md

+3. **Test Task Master integration**:
+   ```bash
+   task-master models --test claude-code:sonnet
+   # Should successfully connect and test the model
+   ```


💡 Verification agent

🧩 Analysis chain

Verify CLI command existence: task-master models --test

The guide references a models --test subcommand. Confirm this exists; otherwise, replace with an existing health-check command or add the command.

🏁 Script executed:

#!/bin/bash # Search for a --test flag on the models command rg -n "models\s+--test" -- tests src scripts rg -n "command\(['\"]models['\"]" -A 20

Length of output: 1338

Replace the nonexistent --test flag with the interactive --setup command

The models --test option isn’t implemented in the CLI. Update the documentation (docs/claude-code-integration.md:147–151) to use the existing --setup command—or, if you intended to provide a health-check flag, implement --test in the models command.

Locations to update:

docs/claude-code-integration.md (lines 147–151)

Suggested diff:

- 3. **Test Task Master integration**: - ```bash - task-master models --test claude-code:sonnet - # Should successfully connect and test the model - ``` + 3. **Configure and verify Task Master integration**: + ```bash + task-master models --setup + # Follow the interactive prompts to configure your provider/model + # and confirm connectivity + ```

🧰 Tools

🪛 LanguageTool

[grammar] ~147-~147: There might be a mistake here.
Context: ... is valid 3. **Test Task Master integration**: bash task-master models --test claude-code:sonnet # Should successfully connect and test the model ``` 4. Check logs: - Task Master logs...

(QB_NEW_EN_OTHER)

🤖 Prompt for AI Agents

In docs/claude-code-integration.md around lines 147 to 151, the documentation incorrectly uses the nonexistent `--test` flag with the `task-master models` command. Replace the `--test` flag usage with the interactive `--setup` command and update the instructions to indicate following the interactive prompts to configure and verify the provider/model connectivity.

coderabbitai · 2025-08-10T04:50:55Z

docs/SPEC-models-dev-integration.md

+## Goals
+1. **Dynamic Model Discovery**: Replace static model lists with live data from models.dev API
+2. **Enhanced Model Information**: Provide richer model metadata (pricing, capabilities, context limits)
+3. **Zero Breaking Changes**: Maintain existing CLI interface and behavior
+4. **Graceful Degradation**: Fall back to static models if models.dev is unavailable
+5. **Performance**: Cache models.dev data with appropriate TTL to avoid API rate limits
+


🧹 Nitpick (assertive)

Minor wording: avoid tautology “CLI interface”

Use “CLI” instead of “CLI interface” and keep consistent capitalization (“API”, “TTL”), plus tidy spacing around headings.

-3. **Zero Breaking Changes**: Maintain existing CLI interface and behavior +3. **Zero Breaking Changes**: Maintain existing CLI behavior -5. **Performance**: Cache models.dev data with appropriate TTL to avoid API rate limits +5. **Performance**: Cache models.dev data with an appropriate TTL to avoid API rate limits

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

## Goals

1. **Dynamic Model Discovery**: Replace static model lists with live data from models.dev API

2. **Enhanced Model Information**: Provide richer model metadata (pricing, capabilities, context limits)

3. **Zero Breaking Changes**: Maintain existing CLI interface and behavior

4. **Graceful Degradation**: Fall back to static models if models.dev is unavailable

5. **Performance**: Cache models.dev data with appropriate TTL to avoid API rate limits

## Goals

1. **Dynamic Model Discovery**: Replace static model lists with live data from models.dev API

2. **Enhanced Model Information**: Provide richer model metadata (pricing, capabilities, context limits)

3. **Zero Breaking Changes**: Maintain existing CLI behavior

4. **Graceful Degradation**: Fall back to static models if models.dev is unavailable

5. **Performance**: Cache models.dev data with an appropriate TTL to avoid API rate limits

🧰 Tools

🪛 LanguageTool

[grammar] ~7-~7: Use proper capitalization
Context: ...el lists with live data from models.dev API 2. Enhanced Model Information: Provide ri...

(QB_NEW_EN_OTHER_ERROR_IDS_6)

[grammar] ~8-~8: There might be a mistake here.
Context: ...etadata (pricing, capabilities, context limits) 3. Zero Breaking Changes: Maintain existi...

(QB_NEW_EN_OTHER)

[style] ~9-~9: This phrase is redundant (‘I’ stands for ‘interface’). Use simply “CLI”.
Context: ...o Breaking Changes**: Maintain existing CLI interface and behavior 4. *Graceful Degradation...

(ACRONYM_TAUTOLOGY)

[grammar] ~9-~9: There might be a mistake here.
Context: ...**: Maintain existing CLI interface and behavior 4. Graceful Degradation: Fall back to sta...

(QB_NEW_EN_OTHER)

[grammar] ~10-~10: There might be a mistake here.
Context: ... back to static models if models.dev is unavailable 5. Performance: Cache models.dev data wit...

(QB_NEW_EN_OTHER)

[grammar] ~11-~11: There might be a mistake here.
Context: ... with appropriate TTL to avoid API rate limits ## Current State Analysis ### Existing Mod...

(QB_NEW_EN_OTHER)

🤖 Prompt for AI Agents

In docs/SPEC-models-dev-integration.md around lines 6 to 12, replace the phrase "CLI interface" with just "CLI" to avoid tautology, ensure consistent capitalization of terms like "API" and "TTL", and adjust spacing around headings for better formatting and readability.

coderabbitai · 2025-08-10T04:50:55Z

docs/SPEC-models-dev-integration.md

+#### Enhanced getAvailableModels()
+```typescript
+async function getAvailableModels(): Promise<EnhancedModel[]> {
+  try {
+    // Load static models (existing logic)
+    const staticModels = loadStaticModels();
+
+    // Load dynamic models from models.dev
+    const dynamicModels = await modelsDevService.fetchModels();
+
+    // Merge and deduplicate
+    return modelMerger.mergeStaticAndDynamic(staticModels, dynamicModels);
+  } catch (error) {
+    // Fallback to static models
+    console.warn('models.dev unavailable, using static models:', error.message);
+    return loadStaticModels();
+  }
+}
+```


💡 Verification agent

🧩 Analysis chain

Align function naming with implementation: use getAllAvailableModels()

Spec shows enhancing getAvailableModels(), but PR adds a new async getter getAllAvailableModels() and keeps getAvailableModels() unchanged for compatibility. Update the snippet to avoid confusion.

-#### Enhanced getAvailableModels() +#### New getAllAvailableModels() (Non-breaking enhancement) ```typescript -async function getAvailableModels(): Promise<EnhancedModel[]> { +async function getAllAvailableModels(): Promise<EnhancedModel[]> { try { // Load static models (existing logic) const staticModels = loadStaticModels(); // Load dynamic models from models.dev const dynamicModels = await modelsDevService.fetchModels(); // Merge and deduplicate return modelMerger.mergeStaticAndDynamic(staticModels, dynamicModels); } catch (error) { // Fallback to static models console.warn('models.dev unavailable, using static models:', error.message); return loadStaticModels(); } } +// getAvailableModels() remains for backward compatibility and returns static models.

Confirm doc examples elsewhere use getAllAvailableModels(). I can scan and patch remaining references.

🏁 Script executed:

#!/bin/bash set -e echo "Searching for 'getAllAvailableModels' in the codebase:" rg --color=never -n "getAllAvailableModels" || true echo echo "Searching for 'getAvailableModels' references in docs:" rg --color=never -n "getAvailableModels" docs || true echo echo "Searching for 'getAllAvailableModels' references in docs:" rg --color=never -n "getAllAvailableModels" docs || true

Length of output: 1836

Use getAllAvailableModels() in the spec to match the implementation

The docs currently describe enhancing getAvailableModels(), but the code adds a new non-breaking async getter getAllAvailableModels(). To avoid confusion, update the SPEC and all doc references:

• docs/SPEC-models-dev-integration.md (around line 141):

Change the section header and function name in the snippet.

Note that getAvailableModels() remains for backward compatibility.

• docs/SPEC-models-dev-integration.md (around line 217):

Update the checklist to reference getAllAvailableModels().

• All other docs references to getAvailableModels() → review and update where they describe the new dynamic-model flow.

Suggested diff for the snippet in SPEC-models-dev-integration.md:

-#### Enhanced getAvailableModels() +#### New getAllAvailableModels() (non-breaking enhancement) ```typescript -async function getAvailableModels(): Promise<EnhancedModel[]> { +async function getAllAvailableModels(): Promise<EnhancedModel[]> { try { // Load static models (existing logic) const staticModels = loadStaticModels(); // Load dynamic models from models.dev const dynamicModels = await modelsDevService.fetchModels(); // Merge and deduplicate return modelMerger.mergeStaticAndDynamic(staticModels, dynamicModels); } catch (error) { // Fallback to static models console.warn('models.dev unavailable, using static models:', error.message); return loadStaticModels(); } } +// Note: getAvailableModels() remains unchanged for backward compatibility and still returns only static models.

🧰 Tools

🪛 LanguageTool

[grammar] ~141-~141: Use correct spacing
Context: ...## 3. Integration Points #### Enhanced getAvailableModels() typescript async function getAvailableModels(): Promise<EnhancedModel[]> { try { // Load static models (existing logic) const staticModels = loadStaticModels(); // Load dynamic models from models.dev const dynamicModels = await modelsDevService.fetchModels(); // Merge and deduplicate return modelMerger.mergeStaticAndDynamic(staticModels, dynamicModels); } catch (error) { // Fallback to static models console.warn('models.dev unavailable, using static models:', error.message); return loadStaticModels(); } } #### Backward Compatibility Strategy 1. **Int...

(QB_NEW_EN_OTHER_ERROR_IDS_5)

🤖 Prompt for AI Agents

In docs/SPEC-models-dev-integration.md around lines 141 to 159, update the function name in the code snippet from getAvailableModels() to getAllAvailableModels() to match the actual implementation. Change the section header accordingly and add a note that getAvailableModels() remains unchanged for backward compatibility, returning only static models. Also, review and update all other references in this file and elsewhere in the docs where getAvailableModels() is mentioned in the context of the new dynamic model flow, replacing them with getAllAvailableModels() as appropriate.

coderabbitai · 2025-08-10T04:50:58Z

tests/integration/models-dev-integration.test.js

+// Skip tests if no internet connection or API is down
+const isOnline = process.env.NODE_ENV !== 'test' || process.env.TEST_MODELS_DEV === 'true';
+
+describe('Models.dev Integration', () => {
+	// Only run these tests if explicitly enabled or not in CI
+	const runTest = isOnline ? test : test.skip;
+


🧹 Nitpick (assertive)

Avoid network by default; good gating, but prefer full mocks

Guarding with TEST_MODELS_DEV is fine. For integration in CI, favor mocking service modules to keep tests deterministic and fast.

🤖 Prompt for AI Agents

In tests/integration/models-dev-integration.test.js around lines 12 to 18, the current setup conditionally skips tests based on environment variables but still relies on actual network calls. To fix this, replace or supplement the conditional test gating with full mocks of the external service modules used in the integration tests. This will avoid real network calls, making tests deterministic and faster, especially in CI environments.

coderabbitai · 2025-08-10T04:50:58Z

tests/integration/models-dev-integration.test.js

+	runTest('should handle dynamic model loading', async () => {
+		const enhanced = await getAllAvailableModels();
+
+		expect(Array.isArray(enhanced)).toBe(true);
+		expect(enhanced.length).toBeGreaterThan(0);
+
+		// Should have models from both sources
+		const sources = [...new Set(enhanced.map(m => m.source))];
+		expect(sources).toEqual(expect.arrayContaining(['static']));
+		// May also contain 'models.dev' if API is available
+	}, 30000);


🛠️ Refactor suggestion

Respect mock-first-then-import pattern

Per repo rules, declare mocks before importing the module under test. Convert static imports of getAllAvailableModels to dynamic import after mocks to ensure isolation.

🤖 Prompt for AI Agents

In tests/integration/models-dev-integration.test.js around lines 80 to 90, the test imports getAllAvailableModels statically before setting up mocks, violating the mock-first-then-import pattern. To fix this, remove the static import of getAllAvailableModels at the top, set up all necessary mocks first, then dynamically import getAllAvailableModels within the test or before running tests to ensure mocks are applied correctly and isolation is maintained.

coderabbitai · 2025-08-10T04:50:58Z

tests/integration/models-dev-integration.test.js

+	test('should fallback gracefully when API fails', async () => {
+		// Mock API failure
+		const originalFetch = global.fetch;
+		global.fetch = jest.fn().mockRejectedValue(new Error('Network error'));
+
+		try {
+			const enhanced = await getAllAvailableModels();
+
+			// Should still return models (may be cached or static)
+			expect(Array.isArray(enhanced)).toBe(true);
+			expect(enhanced.length).toBeGreaterThan(0);
+
+			// All models should have a source property
+			expect(enhanced.every(m => m.source)).toBe(true);
+		} finally {
+			global.fetch = originalFetch;
+		}
+	});


🧹 Nitpick (assertive)

🛠️ Refactor suggestion

Mock the service module, not global.fetch

modelsDevService may not use global.fetch. Mock the module before imports (ESM) to force an error and validate fallback.

Use ESM pre-mock + dynamic import:

// At top of file (before any imports that pull config-manager) import { jest } from '@jest/globals'; jest.unstable_mockModule('../../src/services/models-dev-service.js', () => ({ modelsDevService: { fetchModels: jest.fn(async () => { throw new Error('Network error'); }), getProviderModels: jest.fn(async () => { throw new Error('Network error'); }), searchModels: jest.fn(async () => { throw new Error('Network error'); }), } })); const { getAllAvailableModels } = await import('../../scripts/modules/config-manager.js'); test('should fallback gracefully when API fails', async () => { const enhanced = await getAllAvailableModels(); expect(Array.isArray(enhanced)).toBe(true); expect(enhanced.length).toBeGreaterThan(0); expect(enhanced.every(m => m.source)).toBe(true); });

I can refactor the file to this pattern if you prefer.

🤖 Prompt for AI Agents

In tests/integration/models-dev-integration.test.js around lines 92 to 109, the test mocks global.fetch which may not affect the actual API calls since modelsDevService likely does not use global.fetch directly. Instead, mock the modelsDevService module itself before importing the module under test using Jest's unstable_mockModule for ESM support. Replace the global.fetch mock with a pre-import mock of modelsDevService's methods to throw errors, then dynamically import getAllAvailableModels and run the test to verify fallback behavior.

coderabbitai · 2025-08-10T04:50:58Z

tests/unit/ai-providers/claude-code.test.js

+  createClaudeCode: jest.fn(() => {
+    const provider = (modelId, settings) => ({
+      // Minimal mock language model surface
+      id: modelId,
+      settings,
+      doGenerate: jest.fn(async () => ({ text: 'ok', usage: {} })),
+      doStream: jest.fn(async () => ({ stream: true }))
+    });
+    provider.languageModel = jest.fn((id, settings) => ({ id, settings }));
+    provider.chat = provider.languageModel;
+    return provider;
+  })
+}));


🛠️ Refactor suggestion

Make mock methods synchronous to comply with test guidelines

Tests should avoid async ops. These mocks can be synchronous without changing expectations.

- doGenerate: jest.fn(async () => ({ text: 'ok', usage: {} })), - doStream: jest.fn(async () => ({ stream: true })) + doGenerate: jest.fn(() => ({ text: 'ok', usage: {} })), + doStream: jest.fn(() => ({ stream: true }))

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

createClaudeCode: jest.fn(() => {

const provider = (modelId, settings) => ({

// Minimal mock language model surface

id: modelId,

settings,

doGenerate: jest.fn(async () => ({ text: 'ok', usage: {} })),

doStream: jest.fn(async () => ({ stream: true }))

});

provider.languageModel = jest.fn((id, settings) => ({ id, settings }));

provider.chat = provider.languageModel;

return provider;

})

}));

createClaudeCode: jest.fn(() => {

const provider = (modelId, settings) => ({

// Minimal mock language model surface

id: modelId,

settings,

doGenerate: jest.fn(() => ({ text: 'ok', usage: {} })),

doStream: jest.fn(() => ({ stream: true }))

});

provider.languageModel = jest.fn((id, settings) => ({ id, settings }));

provider.chat = provider.languageModel;

return provider;

})

}));

🤖 Prompt for AI Agents

In tests/unit/ai-providers/claude-code.test.js between lines 5 and 17, the mock methods doGenerate and doStream are asynchronous but should be synchronous to comply with test guidelines. Change these jest.fn implementations to return their results synchronously instead of using async functions, ensuring the mocks still return the expected objects without using async/await.

coderabbitai · 2025-08-10T04:50:58Z

tests/unit/ai-providers/claude-code.test.js

 		it('should handle client initialization errors', async () => {
 			// Force an error by making createClaudeCode throw
 			const { createClaudeCode } = await import(
-				'../../../src/ai-providers/custom-sdk/claude-code/index.js'
+				'ai-sdk-provider-claude-code'
 			);
 			createClaudeCode.mockImplementationOnce(() => {
 				throw new Error('Mock initialization error');
 			});

 			// Create a new provider instance to use the mocked createClaudeCode
 			const errorProvider = new ClaudeCodeProvider();
-			expect(() => errorProvider.getClient({})).toThrow(
-				'Mock initialization error'
-			);
+			expect(() => errorProvider.getClient({})).toThrow('Mock initialization error');
 		});


🛠️ Refactor suggestion

Avoid async in test body; import the mocked module in setup and keep the test synchronous

Per guidelines, avoid async in tests. Move the dynamic import of the mocked module to beforeAll and perform mockImplementationOnce in the test.

- it('should handle client initialization errors', async () => { - const { createClaudeCode } = await import('ai-sdk-provider-claude-code'); - createClaudeCode.mockImplementationOnce(() => { - throw new Error('Mock initialization error'); - }); - const errorProvider = new ClaudeCodeProvider(); - expect(() => errorProvider.getClient({})).toThrow('Mock initialization error'); - }); + let mockedCreate; + beforeAll(async () => { + ({ createClaudeCode: mockedCreate } = await import('ai-sdk-provider-claude-code')); + }); + it('should handle client initialization errors', () => { + mockedCreate.mockImplementationOnce(() => { throw new Error('Mock initialization error'); }); + const errorProvider = new ClaudeCodeProvider(); + expect(() => errorProvider.getClient({})).toThrow('Mock initialization error'); + });

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

it('should handle client initialization errors', async () => {

// Force an error by making createClaudeCode throw

const { createClaudeCode } = await import(

'../../../src/ai-providers/custom-sdk/claude-code/index.js'

'ai-sdk-provider-claude-code'

);

createClaudeCode.mockImplementationOnce(() => {

throw new Error('Mock initialization error');

});

// Create a new provider instance to use the mocked createClaudeCode

const errorProvider = new ClaudeCodeProvider();

expect(() => errorProvider.getClient({})).toThrow(

'Mock initialization error'

);

expect(() => errorProvider.getClient({})).toThrow('Mock initialization error');

});

let mockedCreate;

beforeAll(async () => {

({ createClaudeCode: mockedCreate } = await import('ai-sdk-provider-claude-code'));

});

it('should handle client initialization errors', () => {

mockedCreate.mockImplementationOnce(() => {

throw new Error('Mock initialization error');

});

const errorProvider = new ClaudeCodeProvider();

expect(() => errorProvider.getClient({})).toThrow('Mock initialization error');

});

🤖 Prompt for AI Agents

In tests/unit/ai-providers/claude-code.test.js around lines 111 to 123, the test uses an async function and dynamically imports the mocked module inside the test body. To fix this, move the import of 'ai-sdk-provider-claude-code' to a beforeAll hook at the top of the test suite, so it runs once before tests. Then, keep the test function synchronous and only perform the mockImplementationOnce inside the test. This avoids async in the test body and follows best practices for mocking.

Crunchyman-ralph

watch out, this PR is integrating changes from this PR which are separate concerns: 1114

Crunchyman-ralph · 2025-09-24T09:09:31Z

@tommy-ca mind removing the changes related to the #1114 PR so that they are separate PR and no confusion between them ? Else I'll have to close this PR and make a fresh one over your changes, have to do a whole different process

Crunchyman-ralph · 2026-01-11T19:38:06Z

Closing as stale, but I like the base of this @claude can you make a separate fresh PR based on next that tries to do this, pelase be clean, try to write in typescript if you can. make sure it goes where it needs to (apps/cli or packages/tm-bridge ?)

Please create a good PR description, title, and make sure to run npm run format before you push and create your PR

claude · 2026-01-11T19:38:17Z

Claude encountered an error —— View job

Command failed: git fetch origin --depth=20 feature/models-dev-minimal

I'll analyze this and get back to you.

tommy-ca and others added 3 commits August 10, 2025 04:01

chore: make test script executable

540354b

tommy-ca changed the base branch from main to next August 10, 2025 03:46

tommy-ca and others added 2 commits August 10, 2025 06:10

tommy-ca force-pushed the feature/models-dev-minimal branch from 956d088 to 86b9d4c Compare August 10, 2025 04:33

tommy-ca force-pushed the feature/models-dev-minimal branch from 86b9d4c to 2bdc226 Compare August 10, 2025 04:36

coderabbitai bot reviewed Aug 10, 2025

View reviewed changes

Crunchyman-ralph requested changes Sep 1, 2025

View reviewed changes

Conversation

tommy-ca commented Aug 10, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Implementation Details

Key Features

Files Changed (7 total)

Summary by CodeRabbit

Uh oh!

changeset-bot bot commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

coderabbitai bot commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

tommy-ca commented Aug 10, 2025

🔧 Latest Updates - Bug Fixes & Refinements

Recent commits added important fixes:

✅ Fix: Resolve empty log lines and claude-code model availability (86b9d4c)

✅ All Tests Passing

Uh oh!

tommy-ca commented Aug 10, 2025

🧹 Cleaned Up Commit

✅ What's Included:

❌ What's Excluded:

Uh oh!

tommy-ca commented Aug 10, 2025

🎛️ New Feature: Provider & Cost Filters

✨ New Filter Options:

🔍 Provider Filters:

💰 Cost Filters:

🚀 Key Features:

📋 Usage Examples:

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 10, 2025

Choose a reason for hiding this comment

Uh oh!

Crunchyman-ralph left a comment

Choose a reason for hiding this comment

tommy-ca commented Aug 10, 2025 •

edited by coderabbitai bot

Loading

changeset-bot bot commented Aug 10, 2025 •

edited

Loading

coderabbitai bot commented Aug 10, 2025 •

edited

Loading

✅ Fix: Resolve empty log lines and claude-code model availability (`86b9d4c`)

claude bot commented Jan 11, 2026 •

edited

Loading