fix(llmobs): fix input token counting for bedrock prompt caching #13919

lievan · 2025-07-08T21:00:09Z

When prompt caching is used in bedrock, the number of input tokens returned in the usage field is the number of non cached tokens, not the total number of tokens sent to the model (what we expect in datadog)

This pr fixes this by setting input tokens to the total number of input tokens (cache read + cache write + input tokens)

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

github-actions · 2025-07-08T21:00:40Z

CODEOWNERS have been resolved as:

releasenotes/notes/fix-bedrock-input-tokens-unified-calculation-b2c3d4e5f6g7h8i9.yaml  @DataDog/apm-python
ddtrace/llmobs/_integrations/bedrock.py                                 @DataDog/ml-observability
ddtrace/llmobs/_integrations/bedrock_utils.py                           @DataDog/ml-observability
tests/contrib/botocore/test_bedrock_llmobs.py                           @DataDog/ml-observability

github-actions · 2025-07-08T21:21:14Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 279 ± 3 ms.

The average import time from base is: 281 ± 3 ms.

The import time difference between this PR and base is: -1.7 ± 0.1 ms.

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 1.960 ms (0.70%)

ddtrace.bootstrap.sitecustomize 1.281 ms (0.46%)

ddtrace.bootstrap.preload 1.281 ms (0.46%)

ddtrace.internal.remoteconfig.client 0.649 ms (0.23%)

ddtrace 0.678 ms (0.24%)

ddtrace.internal._unpatched 0.031 ms (0.01%)

json 0.031 ms (0.01%)

json.decoder 0.031 ms (0.01%)

re 0.031 ms (0.01%)

enum 0.031 ms (0.01%)

types 0.031 ms (0.01%)

pr-commenter · 2025-07-08T21:46:56Z

Benchmarks

Benchmark execution time: 2025-07-15 03:56:37

Comparing candidate commit f2db3db in PR branch evan.li/fix-bedrock-tokens with baseline commit 573a530 in branch main.

Found 0 performance improvements and 2 performance regressions! Performance is the same for 480 metrics, 2 unstable metrics.

scenario:iastaspects-lstrip_aspect

🟥 execution_time [+736.422ns; +823.072ns] or [+7.031%; +7.859%]

scenario:iastaspects-replace_aspect

🟥 execution_time [+746.774ns; +807.135ns] or [+15.835%; +17.115%]

releasenotes/notes/fix-bedrock-input-tokens-unified-calculation-b2c3d4e5f6g7h8i9.yaml

Yun-Kim

Cosmetic nits but LGTM

ddtrace/llmobs/_integrations/bedrock_utils.py

releasenotes/notes/fix-bedrock-input-tokens-unified-calculation-b2c3d4e5f6g7h8i9.yaml

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

…n-b2c3d4e5f6g7h8i9.yaml Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

lievan added 2 commits July 8, 2025 16:22

ant

d6e7c83

cleanup

1a1bcda

lievan requested review from a team as code owners July 8, 2025 21:00

lievan requested review from duncanista and rachelyangdog July 8, 2025 21:00

github-actions bot added the backport 2.21 label Jul 8, 2025

Merge branch 'main' into evan.li/fix-bedrock-tokens

7665b92

emmettbutler approved these changes Jul 9, 2025

View reviewed changes

releasenotes/notes/fix-bedrock-input-tokens-unified-calculation-b2c3d4e5f6g7h8i9.yaml Outdated Show resolved Hide resolved

rachelyangdog approved these changes Jul 9, 2025

View reviewed changes

Yun-Kim approved these changes Jul 11, 2025

View reviewed changes

ddtrace/llmobs/_integrations/bedrock_utils.py Outdated Show resolved Hide resolved

releasenotes/notes/fix-bedrock-input-tokens-unified-calculation-b2c3d4e5f6g7h8i9.yaml Outdated Show resolved Hide resolved

lievan removed the backport 2.21 label Jul 14, 2025

lievan and others added 2 commits July 14, 2025 14:41

Update ddtrace/llmobs/_integrations/bedrock_utils.py

7ecd087

Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

Update releasenotes/notes/fix-bedrock-input-tokens-unified-calculatio…

31103e9

…n-b2c3d4e5f6g7h8i9.yaml Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

lievan enabled auto-merge (squash) July 14, 2025 18:42

lievan added 2 commits July 14, 2025 17:45

clarify read/write

2bfcf0a

ruff

f2db3db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(llmobs): fix input token counting for bedrock prompt caching #13919

fix(llmobs): fix input token counting for bedrock prompt caching #13919

lievan commented Jul 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented Jul 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Yun-Kim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix(llmobs): fix input token counting for bedrock prompt caching #13919

Are you sure you want to change the base?

fix(llmobs): fix input token counting for bedrock prompt caching #13919

Conversation

lievan commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Reviewer Checklist

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastaspects-lstrip_aspect

scenario:iastaspects-replace_aspect

Uh oh!

Uh oh!

Yun-Kim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lievan commented Jul 8, 2025 •

edited

Loading

github-actions bot commented Jul 8, 2025 •

edited

Loading

pr-commenter bot commented Jul 8, 2025 •

edited

Loading