[loadbalancingexporter] Not properly batching service traces #13826

crobertson-conga · 2022-09-01T19:46:24Z

Describe the bug
New loadbalancingexporter option for grouping traces by service name is sending all traces in a block every time instead of splitting up the set of traces to those that only belong to the specific export.

Steps to reproduce
Use new routing_key: service option to start splitting up the traces by service. Have at least 2 receiving collectors. In the receiving collectors, use a resource detection processor to augment the trace payload so you can see which collector is receiving a trace.

What did you expect to see?
All traces from a specific service name should have the same receiving processor

What did you see instead?
Traces from a specific service name went to both processors.

What version did you use?
0.59.0

What config did you use?

      loadbalancing/spanmetrics:
        routing_key: service
        protocol:
          otlp:
            tls:
              insecure: true
        resolver:
          dns:
            hostname: <some_k8)sservice_to_target_collectors>
            port: 4317
            interval: 1m

Environment
Doesn't matter

Additional context
Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

crobertson-conga · 2022-09-01T19:54:28Z

@aishyandapalli this is an FYI, I think your new feature in regards to #12421 has a bug in it. I think its stemming from

opentelemetry-collector-contrib/exporter/loadbalancingexporter/trace_exporter.go

Line 121 in b30f3e9

err = te.ConsumeTraces(ctx, td)

consuming all traces instead of just the ones associated with the routing key.

crobertson-conga · 2022-09-01T23:09:21Z

Actually I'm not sure if that's the problem. I set up batching so that it had a max size of one and all my span metrics collectors are still getting signals across all services

      batch/one: # super inefficient data-wise, but it looks like the loadbalancing exporter doesn't split properly
        send_batch_size: 1
        send_batch_max_size: 1

I have a resource processor on the span metrics processor that annotates the traces coming in, hence the aggregator dimension.

The collector doing span metrics is forwarded metrics from the loadbalancer exporter.

[Edge collectors] -> [Main central collector] -> [Spanmetrics collector(s)]

crobertson-conga · 2022-09-02T14:34:46Z

So doing some more testing leads me to believe it may be due to forcibly closed connections on grpc making the LB move to the next available instance. I will close this if it turns out to be the case.

crobertson-conga · 2022-09-02T14:53:54Z

Okay, so this was due to my configuration which was interrupting the grpc connection regularly. Sorry

crobertson-conga · 2022-09-02T15:33:48Z

Okay after removing my batching of size one, the issue reappeared. I had two problems, one which is resolved by not allowing connections to terminate artificially. The other is if the traces are in a batch with multiple service names, they get get sent to all target collectors with the loadbalancer processor.

This leads me to believe the original issue where all the spans are being sent per endpoint regardless of actual service is correct.

github-actions · 2022-11-10T03:53:42Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/loadbalancing: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

crobertson-conga added the bug Something isn't working label Sep 1, 2022

crobertson-conga changed the title ~~[loadbalancingexporter]~~ [loadbalancingexporter] Not properly batching service traces Sep 1, 2022

jpkrohling self-assigned this Sep 1, 2022

jpkrohling added the exporter/loadbalancing label Sep 1, 2022

crobertson-conga closed this as completed Sep 2, 2022

crobertson-conga reopened this Sep 2, 2022

evan-bradley added the priority:needed Triagers reviewed the issue but need code owner to set priority label Sep 9, 2022

github-actions bot added the Stale label Nov 10, 2022

jpkrohling added never stale Issues marked with this label will be never staled and automatically removed and removed Stale labels Nov 24, 2022

This was referenced Feb 14, 2023

[WIP] Add support for loadbalancing with any resource attribute zjanc/opentelemetry-collector-contrib#1

Closed

[WIP] Add support for loadbalancing with any resource attribute #18769

Closed

forestsword mentioned this issue Aug 23, 2023

[exporter/loadbalancingexporter] batch before exporting in load balancer #18273

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[loadbalancingexporter] Not properly batching service traces #13826

[loadbalancingexporter] Not properly batching service traces #13826

crobertson-conga commented Sep 1, 2022 •

edited

Loading

crobertson-conga commented Sep 1, 2022

crobertson-conga commented Sep 1, 2022 •

edited

Loading

crobertson-conga commented Sep 2, 2022

crobertson-conga commented Sep 2, 2022

crobertson-conga commented Sep 2, 2022 •

edited

Loading

github-actions bot commented Nov 10, 2022

[loadbalancingexporter] Not properly batching service traces #13826

[loadbalancingexporter] Not properly batching service traces #13826

Comments

crobertson-conga commented Sep 1, 2022 • edited Loading

crobertson-conga commented Sep 1, 2022

crobertson-conga commented Sep 1, 2022 • edited Loading

crobertson-conga commented Sep 2, 2022

crobertson-conga commented Sep 2, 2022

crobertson-conga commented Sep 2, 2022 • edited Loading

github-actions bot commented Nov 10, 2022

crobertson-conga commented Sep 1, 2022 •

edited

Loading

crobertson-conga commented Sep 1, 2022 •

edited

Loading

crobertson-conga commented Sep 2, 2022 •

edited

Loading