fix(openai-agents): support json inputs #3354

nirga · 2025-08-29T14:06:14Z

I have added tests that cover my changes.
If adding a new instrumentation or changing an existing one, I've added screenshots from some observability platform showing the change.
PR name follows conventional commits format: feat(instrumentation): ... or fix(instrumentation): ....
(If applicable) I have updated the documentation accordingly.

Important

Adds JSON serialization for dictionary content in OpenAI agent spans and tests this functionality.

Behavior:
- In _hooks.py, on_span_end() now serializes dictionary content in message.content to JSON strings before setting attributes.
- Handles both ResponseSpanData and GenerationSpanData types.
Tests:
- Adds test_dict_content_serialization() in test_openai_agents.py to verify dictionary content is serialized to JSON strings.
- Includes a new VCR cassette test_dict_content_serialization.yaml for testing serialized content.

^{This description was created by}^{for 4ef8bd7. You can customize this summary. It will automatically update as commits are pushed.}

Summary by CodeRabbit

Bug Fixes
- Prompt message content is now consistently serialized to strings in telemetry spans, preventing issues when messages include structured/dict content and improving compatibility with tracing backends.
Tests
- Added a deterministic test and recorded cassette to verify correct serialization of structured prompt content during agent runs without network access.

coderabbitai · 2025-08-29T14:06:21Z

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

Walkthrough

Serializes non-string prompt message content to JSON strings in on_span_end for both object messages (with .content) and dict messages, across LLM_PROMPTS, gen_ai.prompt.*, and legacy fallback paths. Adds a unit test and a VCR cassette to validate dict content serialization.

Changes

Cohort / File(s)	Summary
Instrumentation: on_span_end serialization `packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py`	When recording input_data messages, non-string content on message objects (message.content) is JSON-serialized; when messages are dicts, dict-type message['content'] is JSON-serialized. Applies to SpanAttributes.LLM_PROMPTS, `gen_ai.prompt.{i}.content`, and the legacy fallback branch. No public API changes.
Tests & Fixtures `packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py`, `packages/opentelemetry-instrumentation-openai-agents/tests/cassettes/test_openai_agents/test_dict_content_serialization.yaml`	Adds `test_dict_content_serialization` ensuring prompt/content span attributes contain strings (JSON where appropriate). Adds a VCR cassette capturing a /v1/responses interaction for deterministic replay.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Runner
  participant Hook as on_span_end Hook
  participant Span

  Runner->>Hook: on_span_end(span, input_data)
  Note over Hook: iterate over input_data.messages
  alt message is object with .content
    Hook->>Hook: content = message.content\nif not str -> json.dumps(content)
  else message is dict
    Hook->>Hook: content = message['content']\nif dict -> json.dumps(content)
  else
    Hook->>Hook: use content as-is
  end
  Hook->>Span: setAttribute(LLM_PROMPTS / gen_ai.prompt.{i}.content, serialized_content)
  Span-->>Runner: span finished with stringified prompt attributes

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

fix(langchain): include content attribute when assistant messages have tool calls #3287 — Similar change to stringify non-string/dict message content in span prompt attributes for related instrumentation.

Poem

I nibbled bytes beneath the moon,
Turned curly dicts to tidy tune.
Spans now carry stringy cheer,
Tests replay the call so clear.
Hop, hop—JSON crumbs disappear. 🐇✨

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 4ef8bd7 and 0790af4.

📒 Files selected for processing (1)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (2 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: Test Packages (3.12)
GitHub Check: Test Packages (3.11)
GitHub Check: Build Packages (3.11)
GitHub Check: Lint

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix-agents-json-input

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbit in a new review comment at the desired location with your query.
PR comments: Tag @coderabbit in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbit gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbit read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbit help to get the list of available commands.

Other keywords and placeholders

Add @coderabbit ignore or @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbit summary or @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbit or @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

ellipsis-dev

Caution

Changes requested ❌

Reviewed 4ef8bd7 in 1 minute and 40 seconds. Click for details.

Reviewed 243 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 0 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

Workflow ID: wflow_7xAGeunaRdcztHp2

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

...elemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (1)
320-331: Normalize tool call arguments to string as well.

output.arguments may be dict/array; set_attribute requires primitives/str. Serialize consistently to avoid type rejection.
-                                arguments = getattr(output, 'arguments', '{}')
+                                arguments = _to_attr_str(getattr(output, 'arguments', {}))

♻️ Duplicate comments (1)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (1)

239-242: Serialize all non-string content (lists, tuples, etc.), not just dicts; dedupe with a helper.

Inputs frequently contain lists (e.g., multimodal content arrays). Serializing only dicts leaves lists (or other types) as invalid attribute values. Replace with a generic converter and reuse it in both new and legacy paths to avoid drift.

Apply this diff in each location:

-                            content = message.content
-                            if isinstance(content, dict):
-                                content = json.dumps(content)
+                            content = _to_attr_str(message.content)

-                                content = message['content']
-                                if isinstance(content, dict):
-                                    content = json.dumps(content)
+                                content = _to_attr_str(message['content'])

And in the legacy block:

-                            content = message.content
-                            if isinstance(content, dict):
-                                content = json.dumps(content)
+                            content = _to_attr_str(message.content)

-                                content = message['content']
-                                if isinstance(content, dict):
-                                    content = json.dumps(content)
+                                content = _to_attr_str(message['content'])

Add this helper (top-level or as a @staticmethod on the class):

def _to_attr_str(value: Any) -> str:
    if isinstance(value, str):
        return value
    try:
        # Stable, compact JSON for any non-string (lists, dicts, numbers, booleans, None)
        return json.dumps(value, ensure_ascii=False, separators=(",", ":"), allow_nan=False)
    except (TypeError, ValueError):
        # Last resort: stringify without raising
        return str(value)

Also applies to: 246-249, 374-377, 381-384

🧹 Nitpick comments (3)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (1)

251-279: Optional: cap very large attribute payloads.

Long prompts/JSON can exceed exporter limits and be dropped. Consider truncating serialized strings to a configurable max length (e.g., 16–64KB) with an “…truncated” marker.

packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py (2)

82-88: Use contextlib.suppress and handle JSON arrays too.

Covers []-prefixed JSON and removes try/except noise.

-                # If it looks like JSON, verify it can be parsed
-                if attr_value.startswith('{') and attr_value.endswith('}'):
-                    try:
-                        json.loads(attr_value)
-                    except json.JSONDecodeError:
-                        # If it fails to parse, that's still fine - just not JSON
-                        pass
+                # If it looks like JSON, verify it can be parsed
+                from contextlib import suppress
+                s = attr_value.strip()
+                if s and s[0] in "{[" and s[-1] in "}]":
+                    with suppress(json.JSONDecodeError):
+                        json.loads(s)

67-75: Optional: tighten attribute filter.

"prompt" in attr_name may match unrelated keys; prefer explicit prefixes for stability.

-            prompt_content_check = (
-                ("prompt" in attr_name and "content" in attr_name) or
-                ("gen_ai.prompt" in attr_name and "content" in attr_name)
-            )
+            prompt_content_check = (
+                (attr_name.startswith(f"{SpanAttributes.LLM_PROMPTS}.") and ".content" in attr_name)
+                or (attr_name.startswith("gen_ai.prompt.") and ".content" in attr_name)
+            )

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between bdcd2fa and 4ef8bd7.

📒 Files selected for processing (3)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (2 hunks)
packages/opentelemetry-instrumentation-openai-agents/tests/cassettes/test_openai_agents/test_dict_content_serialization.yaml (1 hunks)
packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py (1 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/cassettes/**/*.{yaml,yml,json}

📄 CodeRabbit inference engine (CLAUDE.md)

Never commit secrets or PII in VCR cassettes; scrub sensitive data

Files:

packages/opentelemetry-instrumentation-openai-agents/tests/cassettes/test_openai_agents/test_dict_content_serialization.yaml

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Store API keys only in environment variables/secure vaults; never hardcode secrets in code
Use Flake8 for code linting and adhere to its rules

Files:

packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py
packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py

🧬 Code graph analysis (2)

packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py (2)

packages/opentelemetry-instrumentation-openai-agents/tests/conftest.py (1)

exporter (27-37)

packages/traceloop-sdk/traceloop/sdk/utils/in_memory_span_exporter.py (1)

get_finished_spans (40-43)

packages/opentelemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py (1)

packages/opentelemetry-semantic-conventions-ai/opentelemetry/semconv_ai/__init__.py (1)

SpanAttributes (64-261)

🪛 Ruff (0.12.2)

packages/opentelemetry-instrumentation-openai-agents/tests/test_openai_agents.py

84-88: Use contextlib.suppress(json.JSONDecodeError) instead of try-except-pass

(SIM105)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Test Packages (3.12)
GitHub Check: Lint
GitHub Check: Build Packages (3.11)

...tation-openai-agents/tests/cassettes/test_openai_agents/test_dict_content_serialization.yaml

…etry/instrumentation/openai_agents/_hooks.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

...elemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py

fix(openai-agents): support json inputs

4ef8bd7

nirga force-pushed the fix-agents-json-input branch from 23db276 to 4ef8bd7 Compare August 29, 2025 14:07

ellipsis-dev bot reviewed Aug 29, 2025

View reviewed changes

...elemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py Outdated Show resolved Hide resolved

coderabbitai bot reviewed Aug 29, 2025

View reviewed changes

...tation-openai-agents/tests/cassettes/test_openai_agents/test_dict_content_serialization.yaml Show resolved Hide resolved

nirga and others added 2 commits August 29, 2025 17:22

Update packages/opentelemetry-instrumentation-openai-agents/opentelem…

48a242a

…etry/instrumentation/openai_agents/_hooks.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

Merge branch 'main' into fix-agents-json-input

0790af4

galkleinman approved these changes Aug 29, 2025

View reviewed changes

...elemetry-instrumentation-openai-agents/opentelemetry/instrumentation/openai_agents/_hooks.py Show resolved Hide resolved

Merge branch 'main' into fix-agents-json-input

e6c14ce

nirga merged commit 23844bb into main Aug 29, 2025
9 checks passed

nirga deleted the fix-agents-json-input branch August 29, 2025 17:35

coderabbitai bot mentioned this pull request Sep 3, 2025

fix(openai-agents): ensure all content types are serialized consistently #3362

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(openai-agents): support json inputs #3354

fix(openai-agents): support json inputs #3354

nirga commented Aug 29, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Aug 29, 2025 •

edited

Loading

Other AI code review bot(s) detected

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Status, Documentation and Community

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(openai-agents): support json inputs #3354

fix(openai-agents): support json inputs #3354

Conversation

nirga commented Aug 29, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other AI code review bot(s) detected

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Status, Documentation and Community

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nirga commented Aug 29, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 29, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)