-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
Labels
observabilityIssues related to observability or telemetryIssues related to observability or telemetrypython
Description
otel has established semantic conventions for tracing server latency.
Propose setting up latency metrics for the following:
gen_ai.server.request.durationgen_ai.server.time_per_output_tokengen_ai.server.time_to_first_token
These metrics allow users to diagnose latency and identify opportunities for pipeline enhancements. This feature will also elevate MS Agent Framework for production use-cases instead of a fun prototyping tool.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
observabilityIssues related to observability or telemetryIssues related to observability or telemetrypython
Type
Projects
Status
No status