[exporter/googlecloud] Correctly translate exception-events from OpenTelemetry to Google Trace format #12885

Jaskaranbir · 2022-08-01T22:37:21Z

The OpenTelemtry spec provides info on exception-handling that is based adding exception-info as Event: https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/semantic_conventions/exceptions.md

Currently, the exception-events passthrough as-is without any extra processing, which leads to following representation in Trace UI:

Notice that the stacktrace above is incomplete and has been character-limited by GCP, which significantly reduces its usefulness.

Meanwhile, referencing the following screenshot from: https://cloud.google.com/trace/docs/trace-labels

The above screenshot includes a specially formatted section for displaying call-stack, which is extremely helpful.

So it would be great if the exceptions exported by OpenTelemetry could be formatted to Trace's conventions to allow displaying them in correct format as achieved by using native Google.Diagnostic libraries. This seems to require a rather tricky stack-trace parsing (especially considering different languages will have their own stack-trace representation), but is there a possibility to develop some conventions to improve the feasibility of this (such as expecting special tags within spans/traces with segments of pre-parsed stack-trace)?

Just in case, .NET Core is my primary development language here and it seems to have fairly under-developed libraries for exporting Otel data to GCP, which is one of the major reasons for using OpenTelemetry Collector.

The text was updated successfully, but these errors were encountered:

github-actions · 2022-08-05T17:09:55Z

Pinging code owners: @aabmass @dashpole @jsuereth @punya @tbarker25 @damemi. See Adding Labels via Comments if you do not have permissions to add labels yourself.

dashpole · 2022-08-05T20:08:30Z

The description of /stacktrace in the docs you reference looks like it only applies to the v1 API, while we use the v2 API:
JSON-formatted stack trace. Stack traces aren't indexed for search.This label is displayed in the Call Stack table.The /stacktrace label is only used by the V1 API.For the V2 API, the span contains a stackTrace field.

So it looks like we would actually want to support setting Span.StackTrace based on the attribute. Otherwise, this seems like it would be nice to have, but it looks like a hard problem to solve. We will probably keep it on the backlog for now, but we appreciate the feedback.

Jaskaranbir · 2022-08-05T21:35:43Z

Thanks for the response @dashpole. I see that StackTrace constitutes the StackFrame here: https://github.com/googleapis/go-genproto/blob/01dd62135a58/googleapis/devtools/cloudtrace/v2/trace.pb.go#L1226

With my limited context, is it still as complex if the client can be expected to handle transforming raw stacktrace into stackframes object above, and pass the structured response via attribute? Even a rudimentary implementation of this would be quite useful.

dashpole · 2022-08-08T15:54:53Z

I don't know if that would be a great contract for the exporter to have. It would be fairly difficult for clients to implement correctly, so I would prefer having something in the exporter. I'm wondering if starting with something as simple as "split the stacktrace into one attribute for each line" (e.g. exception.stacktrace.0, exception.stacktrace.1) would make this useable while we figure out how we want to handle this...

Jaskaranbir · 2022-08-09T16:46:46Z

So it'll display in the TraceUI as multiple tags for now? That sounds good enough as starting point to me.

dashpole · 2022-08-09T18:44:34Z

A teammate pointed me to https://cloud.google.com/trace/docs/quotas, which says we are limited to 32 attributes per-span. I'm worried that the multiple-attribute approach would hit that limit ~instantly, and just end up causing other attributes to be dropped...

Jaskaranbir · 2022-08-09T19:16:29Z

The limit does make sense.

Referring back to a previous comment about having a defined contract for stacktrace, would it be an option to have provisions, based on some configurable options of the exporter, for allowing users to either format their own stacktrace or allowing them to pass a raw stacktrace and attempting to format it?

Since stacktrace format isnt standardized across languages and tools, I think the latter of the above options would be quite difficult to achieve. Although we can still have a best-attempt basis implementation that tries to format a raw stacktrace. But allowing users to define their own call-stack/stacktrace formats will be easier to implement and more flexible. Then in the future, if the latter option works out, the former option (of expecting a pre-structure stacktrace) can be deprecated.

Note: The auto-formatting raw stacktrace doesnt have to be implemented right now. I am really just referring to structuring the code in a way that allows moving towards that goal in future, but start with easier implementation instead that still allows using Trace to its ideal extent.

dashpole · 2022-08-09T19:30:13Z

I could totally see that working for an SDK exporter (e.g. provide a string -> StackTrace function), but that sounds hard to do in the collector via yaml config.

Jaskaranbir · 2022-08-10T00:24:25Z

I was thinking about an implementation where the formatted stacktrace is structured and added as a tag containing JSON data. The JSON structure could even be what V1 API uses for stacktraces:

    "stackTrace": {
      "stackFrames": {
        "frame": [
          {
            "functionName": {
              "value": "serverMethodTrace [as func]"
            },
            "fileName": {
              "value":
              "/usr/src/app/node_modules/@google-cloud/trace-agent/build/src/plugins/plugin-grpc.js"
            },
            "lineNumber": "249",
            "columnNumber": "28"
          },
          {
            "functionName": {
              "value": "anonymous function"
            },
            "fileName": {
              "value": "/usr/src/app/node_modules/grpc/src/server.js"
            },
            "lineNumber": "592",
            "columnNumber": "13"
          }
        ]
      }
    }

So the exporter receives a defined tag with this JSON structure and uses it to build the V2-API call-stack. On the configuration side for the exporter, there's option to either parse the data from this tag as pre-formatted JSON-stacktrace or the raw-stacktrace (for now, it can only support one option, that of pre-formatted stacktrace).

What are your thoughts on something like this?

dashpole · 2022-08-10T19:30:00Z

It seems really hard for users to implement... I'll have to chew on it a bit.

Just for fun, can you try adding this in your exporter config:

trace:
    attribute_mappings:
    - key: exception.stacktrace
      replacement: /stacktrace

To see if the /stacktrace field still works in the v2 API?

Jaskaranbir · 2022-08-12T18:42:25Z

Sorry, was busy in last few days. Though I think some major languages already seem to include stacktrace parsers to an extent. For example, Go, Ruby, Dotnet, JavaScript (albit with external packages, but there are lots available). It'll just be about mapping an object to another.

And considering this is an alternative implementation that can be deprecated anytime and is easy enough to implement, this seems good enough as an optional drop-in until better implementation.

Will test out the /stacktrace field with V2 API when I get chance (likely sometime this weekend), thanks for the idea!

Jaskaranbir · 2022-08-29T00:18:23Z

Finally had the chance to try this out. Though adding the exception.stacktrace attribute didnt quite work, it just adds a /stacktrace label in UI with clipped stacktrace text (same result as issue description).

@dashpole any further thoughts on the intermediate workaround (#12885 (comment))

dashpole · 2022-08-31T16:53:24Z

Not right now. If we did go that route it is likely something we would have to keep around (we wouldn't risk breaking users by deprecating it), so i'd like to give it more thought before moving forward.

Jaskaranbir · 2022-09-05T18:51:41Z

Sounds good, keeping my fingers crossed for this 🤞

github-actions · 2022-11-10T03:52:37Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/googlecloud: @aabmass @dashpole @jsuereth @punya @tbarker25 @damemi

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2023-01-10T03:30:56Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/googlecloud: @aabmass @dashpole @jsuereth @punya @tbarker25 @damemi

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2023-05-27T02:24:33Z

This issue has been closed as inactive because it has been stale for 120 days with no activity.

TylerHelmuth added exporter/googlecloud enhancement New feature or request labels Aug 5, 2022

TylerHelmuth added the priority:p3 Lowest label Aug 5, 2022

github-actions bot added the Stale label Nov 10, 2022

dashpole removed the Stale label Nov 10, 2022

github-actions bot added the Stale label Jan 10, 2023

github-actions bot added the closed as inactive label May 27, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale May 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[exporter/googlecloud] Correctly translate exception-events from OpenTelemetry to Google Trace format #12885

[exporter/googlecloud] Correctly translate exception-events from OpenTelemetry to Google Trace format #12885

Jaskaranbir commented Aug 1, 2022

github-actions bot commented Aug 5, 2022

dashpole commented Aug 5, 2022

Jaskaranbir commented Aug 5, 2022

dashpole commented Aug 8, 2022

Jaskaranbir commented Aug 9, 2022 •

edited

Loading

dashpole commented Aug 9, 2022

Jaskaranbir commented Aug 9, 2022 •

edited

Loading

dashpole commented Aug 9, 2022

Jaskaranbir commented Aug 10, 2022

dashpole commented Aug 10, 2022

Jaskaranbir commented Aug 12, 2022

Jaskaranbir commented Aug 29, 2022 •

edited

Loading

dashpole commented Aug 31, 2022

Jaskaranbir commented Sep 5, 2022

github-actions bot commented Nov 10, 2022

github-actions bot commented Jan 10, 2023

github-actions bot commented May 27, 2023

[exporter/googlecloud] Correctly translate exception-events from OpenTelemetry to Google Trace format #12885

[exporter/googlecloud] Correctly translate exception-events from OpenTelemetry to Google Trace format #12885

Comments

Jaskaranbir commented Aug 1, 2022

github-actions bot commented Aug 5, 2022

dashpole commented Aug 5, 2022

Jaskaranbir commented Aug 5, 2022

dashpole commented Aug 8, 2022

Jaskaranbir commented Aug 9, 2022 • edited Loading

dashpole commented Aug 9, 2022

Jaskaranbir commented Aug 9, 2022 • edited Loading

dashpole commented Aug 9, 2022

Jaskaranbir commented Aug 10, 2022

dashpole commented Aug 10, 2022

Jaskaranbir commented Aug 12, 2022

Jaskaranbir commented Aug 29, 2022 • edited Loading

dashpole commented Aug 31, 2022

Jaskaranbir commented Sep 5, 2022

github-actions bot commented Nov 10, 2022

github-actions bot commented Jan 10, 2023

github-actions bot commented May 27, 2023

Jaskaranbir commented Aug 9, 2022 •

edited

Loading

Jaskaranbir commented Aug 9, 2022 •

edited

Loading

Jaskaranbir commented Aug 29, 2022 •

edited

Loading