feat(llmobs): [MLOB-2681] instrument openai responses with llm #13310

XG-xin · 2025-05-01T16:03:10Z

This PR add LLM tracing/instrumentation for the OpenAI Responses endpoint.

For stream response, we will only get the response.completed chunk because it includes all metadata. Since there's only one response object that is present in each stream chunk, we append it as the first item in streamed_chunks.

This PR only handles function tool, and we might want to support other type of tools (i.e file search) in the furture.

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

ddtrace/llmobs/_integrations/openai.py

github-actions · 2025-05-01T16:03:58Z

CODEOWNERS have been resolved as:

.riot/requirements/1458d7e.txt                                          @DataDog/apm-python
.riot/requirements/164ce6e.txt                                          @DataDog/apm-python
.riot/requirements/1a18a5a.txt                                          @DataDog/apm-python
.riot/requirements/1dd6795.txt                                          @DataDog/apm-python
.riot/requirements/41b0f95.txt                                          @DataDog/apm-python
.riot/requirements/77994b3.txt                                          @DataDog/apm-python
.riot/requirements/c050b53.txt                                          @DataDog/apm-python
.riot/requirements/e3b63a1.txt                                          @DataDog/apm-python
releasenotes/notes/openai-responses-llm-2194499974f7324e.yaml           @DataDog/apm-python
tests/contrib/openai/cassettes/v1/response_error.yaml                   @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/response_function_call.yaml           @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/response_function_call_streamed.yaml  @DataDog/ml-observability
ddtrace/contrib/integration_registry/registry.yaml                      @DataDog/apm-core-python @DataDog/apm-idm-python
ddtrace/contrib/internal/openai/_endpoint_hooks.py                      @DataDog/ml-observability
ddtrace/contrib/internal/openai/utils.py                                @DataDog/ml-observability
ddtrace/llmobs/_integrations/openai.py                                  @DataDog/ml-observability
ddtrace/llmobs/_integrations/utils.py                                   @DataDog/ml-observability
riotfile.py                                                             @DataDog/apm-python
supported_versions_output.json                                          @DataDog/apm-core-python
supported_versions_table.csv                                            @DataDog/apm-core-python
tests/contrib/openai/test_openai_llmobs.py                              @DataDog/ml-observability
tests/contrib/openai/utils.py                                           @DataDog/ml-observability
tests/snapshots/tests.contrib.openai.test_openai.test_response_error.json  @DataDog/apm-python
tests/snapshots/tests.contrib.openai.test_openai.test_response_stream.json  @DataDog/apm-python
tests/snapshots/tests.contrib.openai.test_openai.test_response_tools_stream.json  @DataDog/apm-python
.riot/requirements/109d638.txt                                          @DataDog/apm-python
.riot/requirements/13c42e3.txt                                          @DataDog/apm-python
.riot/requirements/1ce4e3f.txt                                          @DataDog/apm-python
.riot/requirements/1e8124b.txt                                          @DataDog/apm-python
.riot/requirements/35f0cba.txt                                          @DataDog/apm-python
.riot/requirements/5301b11.txt                                          @DataDog/apm-python

github-actions · 2025-05-01T16:24:03Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 274 ± 2 ms.

The average import time from base is: 276 ± 2 ms.

The import time difference between this PR and base is: -2.2 ± 0.1 ms.

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 2.067 ms (0.76%)

ddtrace.bootstrap.sitecustomize 1.398 ms (0.51%)

ddtrace.bootstrap.preload 1.398 ms (0.51%)

ddtrace.internal.remoteconfig.client 0.664 ms (0.24%)

ddtrace 0.669 ms (0.24%)

ddtrace.internal._unpatched 0.030 ms (0.01%)

json 0.030 ms (0.01%)

json.decoder 0.030 ms (0.01%)

re 0.030 ms (0.01%)

enum 0.030 ms (0.01%)

types 0.030 ms (0.01%)

pr-commenter · 2025-05-01T16:54:17Z

Benchmarks

Benchmark execution time: 2025-06-17 01:58:38

Comparing candidate commit 37aa590 in PR branch xinyuan/openai-responses-llm with baseline commit d30bcd7 in branch main.

Found 0 performance improvements and 2 performance regressions! Performance is the same for 563 metrics, 7 unstable metrics.

scenario:iastaspects-replace_aspect

🟥 execution_time [+428.760ns; +550.212ns] or [+9.095%; +11.671%]

scenario:telemetryaddmetric-1-distribution-metric-1-times

🟥 execution_time [+226.052ns; +314.762ns] or [+7.721%; +10.751%]

fixing format

ddtrace/llmobs/_utils.py

ddtrace/llmobs/_llmobs.py

Yun-Kim

Minor nits but good to merge once addressed!

ddtrace/llmobs/_integrations/utils.py

ddtrace/contrib/integration_registry/registry.yaml

ddtrace/llmobs/_integrations/utils.py

releasenotes/notes/openai-responses-llm-2194499974f7324e.yaml

supported_versions_output.json

supported_versions_table.csv

tests/contrib/openai/cassettes/v1/response_stream.yaml

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

…trace-py into xinyuan/openai-responses-llm Merge remote changes

ddtrace/llmobs/_integrations/utils.py

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

XG-xin added 14 commits April 28, 2025 10:02

MLOB-2635 add responses endpoint apm span

9a982e7

collect tools metadata

3b21b55

fix formatting

6a3c5e2

adding back the test_chat_completion_azure

6366987

remove azure test

f928b0f

[MLOB-2680] handling stream response

07f0964

fix span missing openai.response tags

3331ba6

remove steeam_option from record request

936c81a

fix formatting

33a9545

add response stream test

42bb6b3

add response stream test with filtered API key

978f45d

[MLOB-2681] add instrumentation for llm

83f1833

remove openai.py

c9fe05b

[MLOB-2681] add instrumentation for llm

d5f9a09

datadog-datadog-prod-us1 bot reviewed May 1, 2025

View reviewed changes

ddtrace/llmobs/_integrations/openai.py Outdated Show resolved Hide resolved

ddtrace/llmobs/_integrations/openai.py Outdated Show resolved Hide resolved

Add back openai.py without any changes

ac62931

Yun-Kim mentioned this pull request May 1, 2025

feat(openai): [MLOB-2680] support responses endpoint #13304

Merged

2 tasks

XG-xin added 10 commits May 1, 2025 14:44

handle stream input message

e30b0e7

remove llm tags

547e959

move SPAN_KINE within the stream

12745dc

move tests to test_openai file

badb6aa

Remove outdated snapshot test file

f68321c

skip tests for lower openai version

dedb77c

feat/fix/docs/refactor/ci(xxx): commit title here

01b33a0

fixing format

fix errors

1c944cf

fix formatting

359d18d

handle async request. Also update the token usage extraction

afb2c51

minor updates

7c0c15c

datadog-datadog-prod-us1 bot reviewed Jun 12, 2025

View reviewed changes

ddtrace/llmobs/_utils.py Outdated Show resolved Hide resolved

XG-xin added 3 commits June 13, 2025 11:20

improve usage parsing

bd0dd4f

collect kwarg in opena_get_metadata_from_response

112e566

fix format

46df490

lievan reviewed Jun 13, 2025

View reviewed changes

ddtrace/llmobs/_utils.py Outdated Show resolved Hide resolved

lievan reviewed Jun 13, 2025

View reviewed changes

ddtrace/llmobs/_llmobs.py Outdated Show resolved Hide resolved

XG-xin and others added 9 commits June 13, 2025 14:57

remove json format

4a7da8e

update processed_item

6337957

fix test

0b98f6f

fix response error test

870e964

Merge branch 'main' into xinyuan/openai-responses-llm

9105a55

add riot files

a6c4e45

fmt

68a5223

fix response async test

affdf26

add release note

b76c998

Yun-Kim approved these changes Jun 16, 2025

View reviewed changes

XG-xin and others added 7 commits June 16, 2025 16:33

revert openai min version

2d807fb

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

ensure content is str

02f9e90

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

improve the code

1d17ce3

Merge branch 'xinyuan/openai-responses-llm' of github.com:DataDog/dd-…

1cfd274

…trace-py into xinyuan/openai-responses-llm Merge remote changes

revert change

e239c06

update openai min version

e0184dc

fix arguments to be a dict

8d106f7

Yun-Kim reviewed Jun 16, 2025

View reviewed changes

ddtrace/llmobs/_integrations/utils.py Outdated Show resolved Hide resolved

ddtrace/llmobs/_integrations/utils.py Outdated Show resolved Hide resolved

ddtrace/llmobs/_integrations/utils.py Outdated Show resolved Hide resolved

XG-xin and others added 5 commits June 16, 2025 17:36

Update ddtrace/llmobs/_integrations/utils.py

6d6d534

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

Update ddtrace/llmobs/_integrations/utils.py

0039f78

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

Update ddtrace/llmobs/_integrations/utils.py

05917e0

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

Merge branch 'main' into xinyuan/openai-responses-llm

3078f8a

fix typo in release note

37aa590

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llmobs): [MLOB-2681] instrument openai responses with llm #13310

feat(llmobs): [MLOB-2681] instrument openai responses with llm #13310

XG-xin commented May 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 1, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented May 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yun-Kim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat(llmobs): [MLOB-2681] instrument openai responses with llm #13310

Are you sure you want to change the base?

feat(llmobs): [MLOB-2681] instrument openai responses with llm #13310

Conversation

XG-xin commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Reviewer Checklist

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastaspects-replace_aspect

scenario:telemetryaddmetric-1-distribution-metric-1-times

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yun-Kim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

XG-xin commented May 1, 2025 •

edited

Loading

github-actions bot commented May 1, 2025 •

edited

Loading

github-actions bot commented May 1, 2025 •

edited

Loading

pr-commenter bot commented May 1, 2025 •

edited

Loading