Add viz stat-inbound and viz stat-outbound commands #12994

adleong · 2024-08-26T23:10:27Z

We add two new commands to the linkerd viz extension: linkerd viz stat-inbound and linkerd viz stat-outbound. These commands are meant as replacements for the linkerd viz stat. The linkerd viz stat command provides stats when ServiceProfiles are used whereas the new commands provide stats when xRoute resources are used. Either command can be used when no xRoute or ServiceProfile is used but the new commands include several improvements:

Inbound and outbound stats are clearly separated into different commands rather than being contextual based on flag combinations
Route level and backend level stats are displayed together in a tree-view in linkerd viz stat-outbound to easily see the effects of retries, timeouts, and traffic splitting

> linkerd viz stat-outbound -n schlep deploy                  
NAME         SERVICE      ROUTE           TYPE       BACKEND    SUCCESS   RPS  LATENCY_P50  LATENCY_P95  LATENCY_P99  TIMEOUTS  RETRIES  
client-http  schlep:80    schlep-default  HTTPRoute             100.00%  1.00         31ms        387ms        478ms     0.00%    6.25%  
                          └───────────────────────►  schlep:80   93.75%  1.07         16ms         88ms         98ms     1.56%           
client-grpc  schlep:8080  schlep-default  GRPCRoute              98.31%  0.98         36ms        425ms        485ms     0.00%    0.00%  
                          ├───────────────────────►  fail:8080   96.88%  0.53         12ms         24ms         25ms     0.00%           
                          └───────────────────────►  good:8080  100.00%  0.45         25ms         95ms         99ms     0.00%

> linkerd viz stat-inbound -n schlep deploy
NAME         SERVER          ROUTE      TYPE  SUCCESS   RPS  LATENCY_P50  LATENCY_P95  LATENCY_P99  
client-grpc  [default]:4191  [default]        100.00%  0.10          2ms          3ms          3ms  
client-grpc  [default]:4191  probe            100.00%  0.20          0ms          1ms          1ms  
client-http  [default]:4191  [default]        100.00%  0.10          2ms          2ms          2ms  
client-http  [default]:4191  probe            100.00%  0.20          0ms          1ms          1ms  
server-fail  [default]:4191  probe            100.00%  0.20          0ms          1ms          1ms  
server-fail  [default]:4191  [default]        100.00%  0.10          2ms          2ms          2ms  
server-fail  [default]:8080  [default]         94.87%  1.30          0ms          1ms          1ms  
server-good  [default]:4191  [default]        100.00%  0.10          0ms          1ms          1ms  
server-good  [default]:4191  probe            100.00%  0.20          0ms          1ms          1ms  
server-good  [default]:8080  [default]        100.00%  0.73          8ms         92ms         98ms  
server-slow  [default]:4191  [default]        100.00%  0.10          0ms          1ms          1ms  
server-slow  [default]:4191  probe            100.00%  0.20          0ms          1ms          1ms

Unlike the linkerd viz stat command, these commands query prometheus directly rather than going through the intermediary of the metrics-api. If prometheus is enabled in linkerd-viz, these commands will use a port-forward to connect to that prometheus instance. If an external prometheus is configured, these commands will attempt to use that prometheus URL; however note that the prometheus URL must be reachable from where the CLI is executed for this to work. This can be overridden by a --prometheusURL flag.

Json and table output are both supported.

Signed-off-by: Alex Leong <alex@buoyant.io>

alpeb

I'm a big fan of this change. UX vastly improved and the code reads very well too!

I think most of my comments below for stat-inbound also apply to stat-outbound.

cli/cmd/root.go

viz/pkg/api/api.go

alpeb · 2024-08-28T14:37:05Z

viz/cmd/stat-inbound.go

+	}
+	for quantile, resultsChan := range results {
+		go func(quantile string) {
+			defer close(resultsChan)


Don't you need to pin resultsChan?

Apparently not since all the quantile channels do get populated with this code. I think because channels are always passed by reference, the contents of the loop variable are already a reference (instead of being a reference to the loop variable).

hmm I think that being a reference doesn't matter for how closures use those vars, but we likely are just benefiting from go 1.22's fix around this type of issue, so nothing to see here :-)

viz/cmd/stat-inbound.go

alpeb · 2024-08-28T14:55:12Z

viz/cmd/stat-inbound.go

+  * cronjobs
+  * daemonsets
+  * deployments
+  * namespaces
+  * jobs
+  * pods
+  * replicasets
+  * replicationcontrollers
+  * statefulsets`,


Authority is still supported, I think it's worth mentioning it here.

Hmm... technically this works but it kind of breaks the mental model of centering around concrete workloads. I'm tempted to explicitly make this not work. Is there a compelling use case for it?

Not that I know of...

Signed-off-by: Alex Leong <alex@buoyant.io>

adleong added 3 commits August 24, 2024 01:03

Add viz stat-inbound and stat-outbound commands

4680a3a

Signed-off-by: Alex Leong <alex@buoyant.io>

Add json output support

284f275

Signed-off-by: Alex Leong <alex@buoyant.io>

remove stuttering in prometheus package

13e2447

Signed-off-by: Alex Leong <alex@buoyant.io>

adleong requested a review from a team as a code owner August 26, 2024 23:10

Fix table rendering

e6fc667

Signed-off-by: Alex Leong <alex@buoyant.io>

alpeb reviewed Aug 28, 2024

View reviewed changes

feedback

6fd27b0

Signed-off-by: Alex Leong <alex@buoyant.io>

alpeb approved these changes Aug 28, 2024

View reviewed changes

alpeb mentioned this pull request Aug 28, 2024

Update docs to use stat-inbound and stat-outbound linkerd/website#1833

Merged

restore probe routes

85f65ab

Signed-off-by: Alex Leong <alex@buoyant.io>

adleong merged commit 366ab94 into main Aug 29, 2024
39 checks passed

adleong deleted the alex/in-n-out-bound branch August 29, 2024 19:31

wmorgan mentioned this pull request Aug 30, 2024

Health of Linkerd project cncf/toc#1262

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add viz stat-inbound and viz stat-outbound commands #12994

Add viz stat-inbound and viz stat-outbound commands #12994

adleong commented Aug 26, 2024

alpeb left a comment

alpeb Aug 28, 2024

adleong Aug 28, 2024

alpeb Aug 28, 2024

alpeb Aug 28, 2024

adleong Aug 28, 2024

alpeb Aug 28, 2024

Add viz stat-inbound and viz stat-outbound commands #12994

Add viz stat-inbound and viz stat-outbound commands #12994

Conversation

adleong commented Aug 26, 2024

alpeb left a comment

Choose a reason for hiding this comment

alpeb Aug 28, 2024

Choose a reason for hiding this comment

adleong Aug 28, 2024

Choose a reason for hiding this comment

alpeb Aug 28, 2024

Choose a reason for hiding this comment

alpeb Aug 28, 2024

Choose a reason for hiding this comment

adleong Aug 28, 2024

Choose a reason for hiding this comment

alpeb Aug 28, 2024

Choose a reason for hiding this comment