feat(litellm): align embedding instrumentation with pending spec #2238

codefromthecrypt · 2025-09-28T01:53:26Z

Aligns embedding spans with semantic conventions by removing LLM-specific attributes that don't apply to embedding operations and standardizing embedding data structure:

Removes llm.system and llm.provider from embedding spans
Adds EMBEDDING_INVOCATION_PARAMETERS constant to semantic conventions
Adds OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS environment variable
- Deprecates OPENINFERENCE_HIDE_EMBEDDING_VECTORS
Adds OPENINFERENCE_HIDE_EMBEDDINGS_TEXT environment variable
- There was no ENV before
Updates spec documentation with rationale for attribute exclusions

Benefits:

Clearer semantic conventions: embedding.model_name is sufficient for model identification
Avoids ambiguity: llm.system definition unclear when applied to embeddings
Consistent with established patterns: embedding spans use embedding.* prefix exclusively

Breaking changes:

openinference-instrumentation-openai:

llm.system attribute removed from embedding spans
llm.provider attribute removed from embedding spans

openinference-instrumentation-litellm:

Attribute names change from embedding.text → embedding.embeddings.{i}.embedding.text
Attribute names change from embedding.vector → embedding.embeddings.{i}.embedding.vector
embedding.vector now captures all embeddings (not just first) as tuples (not JSON strings)

openinference-instrumentation (TraceConfig):

hide_embedding_vectors behavior changed: now returns "__REDACTED__" instead of removing attribute (None)
- This aligns with documented spec behavior for redacted content

Deprecations:

openinference-instrumentation (TraceConfig):

OPENINFERENCE_HIDE_EMBEDDING_VECTORS deprecated → use OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS
hide_embedding_vectors deprecated → use hide_embeddings_vectors
Both old and new continue to work via OR logic for backwards compatibility

Note

Standardizes embedding spans across SDKs: use CreateEmbeddings, embed invocation params and per-item text/vector, drop LLM system/provider for embeddings, and add new embedding redaction config; updates specs and tests accordingly.

Embedding Semantics & Specs:
- Add SpanAttributes.EMBEDDING_INVOCATION_PARAMETERS; update docs to use span name CreateEmbeddings, record per-item embedding.embeddings.{i}.embedding.text/vector, and clarify no llm.system/provider for embeddings.
- Revise spec for text vs token inputs and vector handling; update configuration docs with new redaction env vars and deprecations.
Config (TraceConfig):
- Add OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS (deprecates OPENINFERENCE_HIDE_EMBEDDING_VECTORS) and OPENINFERENCE_HIDE_EMBEDDINGS_TEXT; masking now uses "__REDACTED__".
OpenAI Instrumentation:
- Embedding spans: name CreateEmbeddings, emit embedding.invocation_parameters, exclude llm.system/provider.
LiteLLM Instrumentation:
- Embeddings: name CreateEmbeddings; capture invocation params; record input texts and all vectors under embedding.embeddings.{i}.*; set embedding.model_name from response when present.
Tests & Cassettes:
- Add embedding tests (single/batch, model name override), update assertions to new keys and span names; adjust token count/instrumentation fixtures.

^{Written by Cursor Bugbot for commit b9dc097. This will update automatically on new commits. Configure here.}

codefromthecrypt · 2025-10-07T19:07:43Z

so this is the same as #2210, and a FAQ might be why return an indexed list? The reason is that if you have a mixed batch request, you'll get multiple embedding vectors back, index correlated with the input text. one example recorded from the openai instrumentation is this one https://github.com/envoyproxy/ai-gateway/blob/main/tests/internal/testopeninference/spans/embeddings-mixed-batch.json

python/instrumentation/openinference-instrumentation-litellm/tests/test_batch_embedding.py

.../openinference-instrumentation-litellm/src/openinference/instrumentation/litellm/__init__.py

codefromthecrypt · 2025-10-11T16:53:01Z

@axiomofjoy I updated this PR including the specs along with your rationale about why embeddings spans should have no system/provider attributes. If I got anything wrong, lemme know and I'll bump it!

Aligns embedding spans with semantic conventions by removing LLM-specific attributes that don't apply to embedding operations and standardizing embedding data structure: - Removes llm.system and llm.provider from embedding spans (LLM spans unchanged) - Adds EMBEDDING_INVOCATION_PARAMETERS constant to semantic conventions - Adds OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS environment variable - Adds OPENINFERENCE_HIDE_EMBEDDINGS_TEXT environment variable (new functionality) - Deprecates OPENINFERENCE_HIDE_EMBEDDING_VECTORS (use OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS) - Updates spec documentation with rationale for attribute exclusions - Fixes security flaw: api_key excluded from embedding.invocation_parameters **Key benefits:** - Clearer semantic conventions: embedding.model_name is sufficient for model identification - Avoids ambiguity: llm.system definition unclear when applied to embeddings - Consistent with established patterns: embedding spans use embedding.* prefix exclusively - Backwards compatible: deprecated hide_embedding_vectors continues to work **Breaking changes:** **openinference-instrumentation-openai:** - `llm.system` attribute removed from embedding spans - `llm.provider` attribute removed from embedding spans **openinference-instrumentation-litellm:** - Attribute names change from `embedding.text` → `embedding.embeddings.{i}.embedding.text` - Attribute names change from `embedding.vector` → `embedding.embeddings.{i}.embedding.vector` - `embedding.vector` now captures all embeddings (not just first) as tuples (not JSON strings) **openinference-instrumentation (TraceConfig):** - `hide_embedding_vectors` behavior changed: now returns `"__REDACTED__"` instead of removing attribute (None) - This aligns with documented spec behavior for redacted content **Deprecations:** **openinference-instrumentation (TraceConfig):** - `OPENINFERENCE_HIDE_EMBEDDING_VECTORS` deprecated → use `OPENINFERENCE_HIDE_EMBEDDINGS_VECTORS` - `hide_embedding_vectors` deprecated → use `hide_embeddings_vectors` - Both old and new continue to work via OR logic for backwards compatibility Signed-off-by: Adrian Cole <adrian@tetrate.io>

Signed-off-by: Adrian Cole <adrian@tetrate.io>

codefromthecrypt · 2025-10-11T19:00:48Z

#2295 for the unrelated inspector test failures

axiomofjoy · 2025-10-23T05:12:29Z

python/instrumentation/openinference-instrumentation-litellm/tests/test_responses.py

 ) -> Iterator[None]:
    LiteLLMInstrumentor().instrument(tracer_provider=tracer_provider)
    yield
-    LiteLLMInstrumentor().uninstrument()


Was this pattern causing an issue?

axiomofjoy · 2025-10-23T05:50:42Z

spec/embedding_spans.md

+## Attributes Not Used in Embedding Spans
+
+The following attributes that are used in LLM spans are **not applicable** to embedding spans:
+
+- `llm.system`: Not used for embedding spans
+- `llm.provider`: Not used for embedding spans
+
+### Rationale
+
+The `llm.system` attribute is defined as "the AI product as identified by the client or server instrumentation." While this definition has been reserved for API providers in LLM spans (e.g., "openai", "anthropic"), it is ambiguous when applied to embedding operations.
+
+In terms of conceptualization, `llm.system` describes the shape of the API, while `llm.provider` describes the owner of the physical hardware that runs those APIs. For observability products like Arize and Phoenix, these conventions are primarily consumed in playground features, allowing re-invocation of LLM calls.


Thanks for taking a stab at this. I'm not sure we need to proscribe using llm.system and llm.provider for embedding spans. Only that we've never used these attributes to describe a client interface like litellm before. In the case of the OpenAI SDK, I think it could make sense to set them. It might make less sense to set them if we don't know the specific API being hit, as is the case for LiteLLM.

Let me chat with @RogerHYang to see if he has the same understanding.

codefromthecrypt requested a review from a team as a code owner September 28, 2025 01:53

github-project-automation bot added this to Instrumentation Sep 28, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Sep 28, 2025

codefromthecrypt mentioned this pull request Sep 28, 2025

feat: add embedding hiding configuration and align spec with instrumentation #2162

Draft

axiomofjoy reviewed Oct 9, 2025

View reviewed changes

python/instrumentation/openinference-instrumentation-litellm/tests/test_batch_embedding.py Outdated Show resolved Hide resolved

.../openinference-instrumentation-litellm/src/openinference/instrumentation/litellm/__init__.py Outdated Show resolved Hide resolved

codefromthecrypt force-pushed the feat/litellm-align-embedding-instrumentation-with-spec branch from c0c6750 to dc87863 Compare October 11, 2025 16:52

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Oct 11, 2025

This comment was marked as outdated.

Sign in to view

codefromthecrypt added 2 commits October 11, 2025 12:56

cleanup

b9dc097

Signed-off-by: Adrian Cole <adrian@tetrate.io>

codefromthecrypt force-pushed the feat/litellm-align-embedding-instrumentation-with-spec branch from dc87863 to b9dc097 Compare October 11, 2025 18:31

axiomofjoy reviewed Oct 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(litellm): align embedding instrumentation with pending spec #2238

feat(litellm): align embedding instrumentation with pending spec #2238

Uh oh!

codefromthecrypt commented Sep 28, 2025 •

edited by cursor bot

Loading

Uh oh!

codefromthecrypt commented Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!

codefromthecrypt commented Oct 11, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

codefromthecrypt commented Oct 11, 2025

Uh oh!

axiomofjoy Oct 23, 2025

Uh oh!

axiomofjoy Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat(litellm): align embedding instrumentation with pending spec #2238

Are you sure you want to change the base?

feat(litellm): align embedding instrumentation with pending spec #2238

Uh oh!

Conversation

codefromthecrypt commented Sep 28, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codefromthecrypt commented Oct 7, 2025

Uh oh!

Uh oh!

Uh oh!

codefromthecrypt commented Oct 11, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

codefromthecrypt commented Oct 11, 2025

Uh oh!

axiomofjoy Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

axiomofjoy Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codefromthecrypt commented Sep 28, 2025 •

edited by cursor bot

Loading