feat: Add OpenAI Responses API Integration by VedantMadane · Pull Request #4248 · crewAIInc/crewAI

VedantMadane · 2026-01-17T05:12:40Z

Summary

Implements native support for OpenAI's Responses API (/v1/responses) as a new LLM provider in CrewAI, addressing feature request #4152.

The Responses API offers significant advantages over the traditional Chat Completions API for agent-based workflows:

Key Benefits

Simpler input format - Use plain strings or structured input instead of complex message arrays
Built-in conversation management - Stateful interactions with \previous_response_id\ for multi-turn conversations
Native tool support - Cleaner function calling semantics with \strict: true\ by default
Streaming support - Real-time token streaming with simpler event handling
Better support for o-series reasoning models -
easoning_effort\ parameter for o1/o3/o4 models

Usage

\\python
from crewai import Agent, LLM

Option 1: Using provider parameter

llm = LLM(model='gpt-4o', provider='openai_responses')

Option 2: Using model prefix

llm = LLM(model='openai_responses/gpt-4o-mini')

With o-series reasoning models

llm = LLM(model='o3-mini', provider='openai_responses', reasoning_effort='high')

Works with all CrewAI components

agent = Agent(
role='Research Analyst',
goal='Find and summarize information',
backstory='Expert researcher',
llm=llm,
)
\\

Implementation Details

New \OpenAIResponsesCompletion\ class extending \BaseLLM\ (~850 lines)
Message conversion: System messages become \instructions\ param, user/assistant messages become \input\ array
Tool calling support with Responses API format (sets \strict: True\ by default)
Streaming support (sync and async)
Structured output via Pydantic models using JSON schema
Support for stateful conversations via \previous_response_id\

Files Changed

\lib/crewai/src/crewai/llm.py\ - Updated factory to support new provider
\lib/crewai/src/crewai/llms/providers/openai_responses/init.py\ - New module init
\lib/crewai/src/crewai/llms/providers/openai_responses/completion.py\ - Main implementation
\lib/crewai/tests/llms/openai_responses/test_openai_responses.py\ - Unit tests

Testing

Added comprehensive unit tests covering:

Provider routing and initialization
Message conversion (system â†’ instructions, user/assistant â†’ input)
Tool conversion with strict mode
Reasoning effort parameter for o-series models
Context window size detection
Streaming parameter handling

Checklist

Code follows existing patterns in other providers (OpenAI, Anthropic, etc.)
Added comprehensive unit tests
Documented usage in docstrings
No breaking changes to existing functionality

Closes #4152

Note

Adds native support for OpenAI’s Responses API and updates provider routing to select it via provider='openai' with api='responses' (with deprecation warnings for openai_responses/ prefix and provider='openai_responses').

New provider: OpenAIResponsesCompletion with sync/async calls, streaming, tool calling (strict mode), structured output via JSON schema, stateful previous_response_id, and o‑series reasoning support
Factory updates (LLM): resolves provider+api, validates models/patterns, supports openai_responses in native registry, and filters provider/api from kwargs; expanded context-pattern checks and supported lists (including Bedrock/Azure nuances)
Message/params mapping: system → instructions, user/assistant → input; manual stop-word handling for Responses API
Context window: dedicated sizes for Responses models and usage ratio applied
Tests: comprehensive unit tests for routing, deprecation, params mapping, tools, streaming, reasoning, context windows, and error paths

^{Written by Cursor Bugbot for commit 3273d0a. This will update automatically on new commits. Configure here.}

Implements native support for OpenAI's Responses API (/v1/responses) as a new LLM provider in CrewAI, addressing feature request crewAIInc#4152. The Responses API offers advantages for agent workflows including: - Simpler input format (strings or structured input vs message arrays) - Built-in conversation management via previous_response_id - Native support for o-series reasoning models (reasoning_effort param) - Cleaner function calling semantics Usage: # Option 1: Using provider parameter llm = LLM(model='gpt-4o', provider='openai_responses') # Option 2: Using model prefix llm = LLM(model='openai_responses/gpt-4o') # With o-series reasoning models llm = LLM(model='o3-mini', provider='openai_responses', reasoning_effort='high') Key implementation details: - New OpenAIResponsesCompletion class extending BaseLLM (~850 lines) - Message conversion: system messages -> instructions param, other -> input - Tool calling support with strict: true by default - Streaming support (sync and async) - Support for stateful conversations via previous_response_id Closes crewAIInc#4152

Ensure async call paths run before/after hooks with the original messages and skip tool execution when argument parsing fails, matching streaming behavior.

VedantMadane · 2026-01-17T08:51:58Z

Fixed Bugbot findings:

async call path now runs before/after hooks with original formatted messages
async handlers invoke after_llm_call hooks (parity with sync)
non-streaming tool calls now skip execution when JSON arg parsing fails

Pushed in 57843a2.

greysonlalonde

Hey @VedantMadane , thanks! This is super helpful. For usage, can we refactor away from passing openai_responses as a provider? I think there is some confusion that could arise there.

VedantMadane · 2026-01-17T18:34:47Z

@greysonlalonde Models like o1, o3, gpt-4o could default to Responses API.

I am picking strict validation over auto-correction based on model capabilities or graceful fallback with warning because:
Explicit is better than implicit like users should know exactly what API they're getting.
Prevents surprises so no silent fallbacks that might behave differently.
Clear error messages where users can immediately see what models work.
API compatibility which forces users to choose appropriate model/API combinations.

The embeddings system in crewAI is actually designed better. OpenAI currently only has one embeddings API (/v1/embeddings), so there's no equivalent confusion to the LLM case.
The embeddings system shows the right approach being provider selects the vendor and other parameters select the specific API/behavior within that vendor.

greysonlalonde · 2026-01-17T19:20:48Z

@greysonlalonde Models like o1, o3, gpt-4o could default to Responses API.

I am picking strict validation over auto-correction based on model capabilities or graceful fallback with warning because:

Explicit is better than implicit like users should know exactly what API they're getting.

Prevents surprises so no silent fallbacks that might behave differently.

Clear error messages where users can immediately see what models work.

API compatibility which forces users to choose appropriate model/API combinations.

The embeddings system in crewAI is actually designed better. OpenAI currently only has one embeddings API (/v1/embeddings), so there's no equivalent confusion to the LLM case.

The embeddings system shows the right approach being provider selects the vendor and other parameters select the specific API/behavior within that vendor.

That'll be a breaking change, we need to provide a deprecation notice prior to doing that.

I'm all for being explicit, but it's better as a flag, or as an additional options key - the current provider/model syntax doesn't really align provider_api/model

VedantMadane · 2026-01-17T19:28:48Z

LLM(model="gpt-4o", provider="openai", api="responses") (or api="chat" default if none specified)

or

LLM(model="gpt-4o", provider="openai", options={"api": "responses"})

?

VedantMadane · 2026-01-18T18:47:01Z

Removed openai_responses provider alias - Now raises an error with guidance:
"provider='openai_responses' is no longer supported. Use provider='openai' with api='responses' instead."
Added api parameter for the new clean syntax:

# NEW (correct)   
LLM(model="gpt-4o", provider="openai", api="responses")      
# OLD (now rejected)     
LLM(model="gpt-4o", provider="openai_responses")  # Raises error

All tests have been updated to use the new api='responses' flag.

VedantMadane · 2026-01-20T19:07:04Z

Previous commit already supports the explicit syntax:
LLM(model="gpt-4o", provider="openai", api="responses") routes to OpenAIResponsesCompletion.

Implemented CrewAI’s preferred deprecation approach (no hard-break)

Per CrewAI’s conventions (warnings.warn(..., DeprecationWarning, stacklevel=2)),
I changed the current hard errors into deprecations + internal mapping:

Model prefix openai_responses/: now works with a DeprecationWarning, maps internally to the openai_responses provider.
Provider provider="openai_responses": now works with a DeprecationWarning, maps internally to openai_responses.
Conflicts like provider="openai_responses", api="chat" (or prefix + api="chat") still raise a ValueError (explicit mismatch).

VedantMadane · 2026-01-20T19:33:54Z

@greysonlalonde Agreed on avoiding a breaking change here. I updated the PR so the legacy provider="openai_responses" and the openai_responses/ prefix continue to work for now, but they emit a DeprecationWarning (stacklevel=2) directing users to the explicit syntax provider="openai", api="responses". The default remains unchanged (no implicit model based switching), and conflicting combinations like provider="openai_responses", api="chat" still error to keep things explicit. This keeps backwards compatibility today while giving a clear migration path before any removal.

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

lib/crewai/src/crewai/llm.py

greysonlalonde · 2026-01-26T01:17:14Z

Hi @VedantMadane , we ended up implementing the a different pr for the responses api here - had an urgent need for it. We really appreciate the work you put in on this

VedantMadane · 2026-01-26T06:39:59Z

No worries at all, glad I could be of help even if just for brainstorming.
Thanks for linking the other PR, I will study the approach it has taken.

This comment was marked as outdated.

Sign in to view

fix: align async hooks and tool parsing

57843a2

Ensure async call paths run before/after hooks with the original messages and skip tool execution when argument parsing fails, matching streaming behavior.

This comment was marked as outdated.

Sign in to view

fix: invoke after_llm_call_hooks for structured output responses

b3b58d0

greysonlalonde requested changes Jan 17, 2026

View reviewed changes

refactor: drop openai_responses provider alias in favor of api flag

7f2d51a

VedantMadane requested a review from greysonlalonde January 17, 2026 20:29

VedantMadane added 2 commits January 20, 2026 23:54

Merge branch 'main' into feat/openai-responses-api-provider

31ff097

Deprecate openai_responses alias/prefix instead of hard error

5b397d6

Merge branch 'main' into feat/openai-responses-api-provider

3273d0a

cursor bot reviewed Jan 23, 2026

View reviewed changes

lib/crewai/src/crewai/llm.py Show resolved Hide resolved

lib/crewai/src/crewai/llm.py Show resolved Hide resolved

greysonlalonde closed this Jan 26, 2026

Conversation

VedantMadane commented Jan 17, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Benefits

Usage

Option 1: Using provider parameter

Option 2: Using model prefix

With o-series reasoning models

Works with all CrewAI components

Implementation Details

Files Changed

Testing

Checklist

Uh oh!

This comment was marked as outdated.

Uh oh!

VedantMadane commented Jan 17, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

greysonlalonde left a comment

Choose a reason for hiding this comment

Uh oh!

VedantMadane commented Jan 17, 2026

Uh oh!

greysonlalonde commented Jan 17, 2026

Uh oh!

VedantMadane commented Jan 17, 2026

Uh oh!

VedantMadane commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VedantMadane commented Jan 20, 2026

Uh oh!

VedantMadane commented Jan 20, 2026

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

greysonlalonde commented Jan 26, 2026

Uh oh!

VedantMadane commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

VedantMadane commented Jan 17, 2026 •

edited by cursor bot

Loading

VedantMadane commented Jan 18, 2026 •

edited

Loading