High input token usage #123

edvanr · 2025-05-27T07:10:13Z

edvanr
May 27, 2025

I've been playing around with Strands Agents and loving it. My use case is around a meeting assistant, and it is helping me with my meetings and calendar data.

I've been testing it out for a few days and just realised my AWS bill for the model has jumped up over $50.

I used some of the metrics to find out token usage and it looks like the majority of it is coming from input tokens.

For a query like "What meeting do I have today?" I'm seeing about 45k tokens being used, which costs around 14c. Does that sound right?

Are there ways to optimise the amount of input that Strands is generating?

awsarron · 2025-06-16T04:21:20Z

awsarron
Jun 16, 2025
Maintainer

Input tokens are made up of:

System prompt
Tool specifications (name, description, input schema)
Agent state - user messages, assistant messages, tool calls, tool results

When running agents, tokens for each cycle can be seen in the AgentResult object (https://strandsagents.com/latest/user-guide/observability-evaluation/metrics/). For example:

from strands import Agent
from strands_tools import current_time

agent = Agent(tools=[current_time])
agent_result = agent("What time is it?")

print("\n\n")
print(agent_result.metrics.get_summary())

Additionally, Strands integrates with OpenTelemetry traces: https://strandsagents.com/latest/user-guide/observability-evaluation/traces/.

Can you share a full code sample and your input messages to the agent that reproduces this issue?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

High input token usage #123

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

High input token usage #123

Uh oh!

edvanr May 27, 2025

Replies: 1 comment

Uh oh!

awsarron Jun 16, 2025 Maintainer

edvanr
May 27, 2025

awsarron
Jun 16, 2025
Maintainer