Fix bug where empty messages might be created in the Anthropic model #1027

oscar-broman · 2025-03-02T07:15:13Z

This PR fixes an issue where empty messages are created.

The error happens when the function _map_message runs with this input:

[
    ModelRequest(parts=[SystemPromptPart(content='<my system prompt>', dynamic_ref=None, part_kind='system-prompt')], kind='request'),
    ModelResponse(parts=[TextPart(content='<model response>', part_kind='text')], model_name=None, timestamp=datetime.datetime(2025, 3, 2, 7, 10, 12, 560121, tzinfo=datetime.timezone.utc), kind='response'),
    ModelRequest(parts=[UserPromptPart(content="<user message>", timestamp=datetime.datetime(2025, 3, 2, 7, 10, 12, 566706, tzinfo=datetime.timezone.utc), part_kind='user-prompt')], kind='request'),
]

The following error is eventually returned from the Anthropic API:

{'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'messages.0: all messages must have non-empty content except for the optional final assistant message'}

Kludex · 2025-03-02T20:10:15Z

Can you provide an MRE, please?

oscar-broman · 2025-03-03T07:02:42Z

from anthropic import AsyncAnthropic
from openai import AsyncOpenAI
from pydantic_ai import Agent
from pydantic_ai.messages import ModelRequest, SystemPromptPart, TextPart, ModelResponse, UserPromptPart
from pydantic_ai.models.anthropic import AnthropicModel
from pydantic_ai.models.openai import OpenAIModel

async def test_anthropic_issue() -> None:
    openai_key = ""
    anthropic_key = ""

    models = [
        ("OpenAI", OpenAIModel(model_name="gpt-4o-mini", openai_client=AsyncOpenAI(api_key=openai_key))),
        ("Anthropic", AnthropicModel(model_name="claude-3-7-sonnet-latest", anthropic_client=AsyncAnthropic(api_key=anthropic_key))),
    ]


    for name, model in models:
        print(f"\n\nModel: {name}")

        agent = Agent(
            model=model,
        )

        messages = [
            ModelRequest(parts=[SystemPromptPart("Respond with the user's last message in upper-case")]),
            # ModelRequest(parts=[UserPromptPart(content="Hello")]), <-- Issue does not happen if uncommented
            ModelResponse(parts=[TextPart(content="HELLO")]),
        ]

        await agent.run(
            user_prompt="Test",
            message_history=messages,
            result_type=str,
        )

alexmojaki · 2025-03-03T13:41:56Z

Please convert the MRE into a test

oscar-broman · 2025-03-04T05:30:18Z

I've tried to do this without any other changes in the codebase -- it's quite hard. The function is an internal function not accessible without some indirections.

One way would be to create a live test, but this seems a bit extreme perhaps.

@alexmojaki What would you suggest I do?

alexmojaki · 2025-03-04T08:26:51Z

OK then I think it's fine unless @Kludex has some ideas.

Kludex · 2025-03-04T08:53:13Z

I'll create a test with VCR.

mike-luabase · 2025-05-27T14:29:52Z

@Kludex I'm hitting this error on most of our attempts to use Claude 3.7

Co-authored-by: Marcelo Trylesinski <marcelotryle@gmail.com>

Fix bug where empty messages might be created in the Anthropic model

4c8644b

Kludex added the awaiting author revision label Mar 2, 2025

alexmojaki requested a review from Kludex March 4, 2025 08:26

DouweM assigned Kludex Apr 29, 2025

DouweM marked this pull request as draft April 30, 2025 21:28

DouweM removed the awaiting author revision label Apr 30, 2025

Kludex added 2 commits May 27, 2025 17:35

Merge remote-tracking branch 'origin/main' into patch-1

69a8c69

Add a potato test

f7f3abf

Kludex approved these changes May 27, 2025

View reviewed changes

Kludex marked this pull request as ready for review May 27, 2025 15:48

Kludex enabled auto-merge (squash) May 27, 2025 15:48

Kludex merged commit 3b12530 into pydantic:main May 27, 2025
16 checks passed

DouweM mentioned this pull request May 29, 2025

Anthropic models failing by injecting phantom empty message #1854

Closed

2 tasks

Kludex added a commit that referenced this pull request May 30, 2025

Don't send empty messages to Anthropic (#1027)

cb4e539

Co-authored-by: Marcelo Trylesinski <marcelotryle@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug where empty messages might be created in the Anthropic model #1027

Fix bug where empty messages might be created in the Anthropic model #1027

Uh oh!

oscar-broman commented Mar 2, 2025

Uh oh!

Kludex commented Mar 2, 2025

Uh oh!

oscar-broman commented Mar 3, 2025

Uh oh!

alexmojaki commented Mar 3, 2025

Uh oh!

oscar-broman commented Mar 4, 2025

Uh oh!

alexmojaki commented Mar 4, 2025

Uh oh!

Kludex commented Mar 4, 2025

Uh oh!

mike-luabase commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Fix bug where empty messages might be created in the Anthropic model #1027

Fix bug where empty messages might be created in the Anthropic model #1027

Uh oh!

Conversation

oscar-broman commented Mar 2, 2025

Uh oh!

Kludex commented Mar 2, 2025

Uh oh!

oscar-broman commented Mar 3, 2025

Uh oh!

alexmojaki commented Mar 3, 2025

Uh oh!

oscar-broman commented Mar 4, 2025

Uh oh!

alexmojaki commented Mar 4, 2025

Uh oh!

Kludex commented Mar 4, 2025

Uh oh!

mike-luabase commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!