feat(llmobs): track prompt caching for openai chat completions #13755

lievan · 2025-06-24T16:28:12Z

Tracks number of tokens read from the prompt cache for openai chat completions

openai does prompt caching by default and returns a cached_tokens field in prompt_tokens_details
https://platform.openai.com/docs/api-reference/chat/create

We rely on two keys in metrics for prompt caching:

cache_read_input_tokens
cache_write_input_tokens

We have both of these fields since bedrock/anthropic return info on cache read/writes

cached_tokens maps to cache_read_input_tokens

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

github-actions · 2025-06-24T16:37:39Z

CODEOWNERS have been resolved as:

releasenotes/notes/oai-p-cache-78c511f97709a357.yaml                    @DataDog/apm-python
tests/contrib/openai/cassettes/v1/chat_completion_prompt_caching_cache_read.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/chat_completion_prompt_caching_cache_write.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/chat_completion_stream_prompt_caching_cache_read.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/chat_completion_stream_prompt_caching_cache_write.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/responses_prompt_caching_cache_read.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/responses_prompt_caching_cache_write.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/responses_stream_prompt_caching_cache_read.yaml  @DataDog/ml-observability
tests/contrib/openai/cassettes/v1/responses_stream_prompt_caching_cache_write.yaml  @DataDog/ml-observability
ddtrace/llmobs/_integrations/openai.py                                  @DataDog/ml-observability
tests/contrib/openai/test_openai_llmobs.py                              @DataDog/ml-observability

github-actions · 2025-06-24T16:59:19Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 275 ± 2 ms.

The average import time from base is: 277 ± 2 ms.

The import time difference between this PR and base is: -1.95 ± 0.08 ms.

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 1.974 ms (0.72%)

ddtrace.bootstrap.sitecustomize 1.299 ms (0.47%)

ddtrace.bootstrap.preload 1.299 ms (0.47%)

ddtrace.internal.remoteconfig.client 0.656 ms (0.24%)

ddtrace 0.675 ms (0.25%)

ddtrace.internal._unpatched 0.032 ms (0.01%)

json 0.032 ms (0.01%)

json.decoder 0.032 ms (0.01%)

re 0.032 ms (0.01%)

enum 0.032 ms (0.01%)

types 0.032 ms (0.01%)

pr-commenter · 2025-06-24T17:25:51Z

Benchmarks

Benchmark execution time: 2025-07-09 15:29:00

Comparing candidate commit 749f06a in PR branch evan.li/openai-prompt-caching with baseline commit 573a530 in branch main.

Found 0 performance improvements and 1 performance regressions! Performance is the same for 523 metrics, 2 unstable metrics.

scenario:iastaspectsospath-ospathnormcase_aspect

🟥 execution_time [+398.337ns; +474.077ns] or [+11.400%; +13.567%]

Yun-Kim

Nicely done! Small nits but otherwise lgtm

releasenotes/notes/oai-p-cache-78c511f97709a357.yaml

tests/contrib/openai/test_openai_llmobs.py

ddtrace/llmobs/_integrations/openai.py

…-trace-py into evan.li/openai-prompt-caching

releasenotes/notes/oai-p-cache-78c511f97709a357.yaml

…-trace-py into evan.li/openai-prompt-caching

lievan added 8 commits June 23, 2025 20:35

oai prompt-caching

efd5f60

p cache test

578049f

refactor test

3755f38

rel note

4765bb4

rel note

bd5d2b0

reword again

2f1bf38

oai p cache

2f3b185

merg cone

35e6b1b

lievan added 3 commits June 24, 2025 13:48

stream

fb85ddd

bad cmt

7e80fb2

clarify

5d65a4e

lievan marked this pull request as ready for review June 24, 2025 19:56

lievan requested review from a team as code owners June 24, 2025 19:56

lievan requested review from gnufede and quinna-h June 24, 2025 19:56

Yun-Kim reviewed Jun 24, 2025

View reviewed changes

ddtrace/llmobs/_integrations/openai.py Show resolved Hide resolved

lievan added 2 commits June 26, 2025 08:54

move over to test agent

e3bc420

rename

ffaa1d1

lievan mentioned this pull request Jun 26, 2025

chore(llmobs): add cassettes for openai prompt caching DataDog/dd-apm-test-agent#216

Merged

lievan added 6 commits June 27, 2025 07:53

change agent url

698f4eb

reword relnote

de79ca3

clarify chat compls

85e881e

Merge branch 'main' into evan.li/openai-prompt-caching

1eaef0c

p cache

030c7c0

Merge branch 'evan.li/openai-prompt-caching' of github.com:DataDog/dd…

e37cdd7

…-trace-py into evan.li/openai-prompt-caching

Yun-Kim approved these changes Jul 3, 2025

View reviewed changes

releasenotes/notes/oai-p-cache-78c511f97709a357.yaml Outdated Show resolved Hide resolved

lievan added 4 commits July 3, 2025 14:47

mv back cassettes

3699ba4

rel note

f24fde3

conf

e98584b

p cache'

d622208

lievan enabled auto-merge (squash) July 3, 2025 19:11

lievan added 7 commits July 3, 2025 15:37

fix tests'

53c1577

Merge branch 'main' into evan.li/openai-prompt-caching

1d8ec53

lint

a8c2c61

Merge branch 'evan.li/openai-prompt-caching' of github.com:DataDog/dd…

5eb80bf

…-trace-py into evan.li/openai-prompt-caching

fix test

9788e5a

Merge branch 'main' into evan.li/openai-prompt-caching

4b43006

fix another one

a71f04d

Yun-Kim mentioned this pull request Jul 8, 2025

[FEATURE]: LLM Observability: Take into account Cached Tokens for OpenAI #13869

Closed

lievan added 4 commits July 8, 2025 10:54

Merge branch 'main' into evan.li/openai-prompt-caching

08497e3

Merge branch 'main' into evan.li/openai-prompt-caching

5949ddc

Merge branch 'main' into evan.li/openai-prompt-caching

0266942

Merge branch 'main' into evan.li/openai-prompt-caching

749f06a

lievan merged commit 4e913c9 into main Jul 9, 2025
462 checks passed

lievan deleted the evan.li/openai-prompt-caching branch July 9, 2025 15:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llmobs): track prompt caching for openai chat completions #13755

feat(llmobs): track prompt caching for openai chat completions #13755

Uh oh!

lievan commented Jun 24, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented Jun 24, 2025 •

edited

Loading

Uh oh!

Yun-Kim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat(llmobs): track prompt caching for openai chat completions #13755

feat(llmobs): track prompt caching for openai chat completions #13755

Uh oh!

Conversation

lievan commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Reviewer Checklist

Uh oh!

github-actions bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastaspectsospath-ospathnormcase_aspect

Uh oh!

Yun-Kim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lievan commented Jun 24, 2025 •

edited

Loading

github-actions bot commented Jun 24, 2025 •

edited

Loading

github-actions bot commented Jun 24, 2025 •

edited

Loading

pr-commenter bot commented Jun 24, 2025 •

edited

Loading