"Monitor" tab for service health metrics

## Proposed sub-tasks

### Jaeger-Query

Owners: @albertteoh 

- [x] #2946: Add metrics query API spec
- [x] #2977: Add Metrics Reader interface
- [x] #2983, #2988, #3004: Add M3 reader implementation
- [x] #3049: Add factory and flags
- [x] #3055: Add TLS support
- [x] #3091: Add GRPC handler
- [x] #3095: Add HTTP handler
- [x] #3060, #3061: Update query service
- [x] #3079: Hookup metrics query to "main"
- [x] #3171: [Explore options to make monitor tab work for all-in-one (or create issue)](https://github.com/jaegertracing/jaeger/issues/3107)

### Jaeger-UI

Owners: @th3M1ke 

- [x] Approve UX & UI design (this issue)
- [x] https://github.com/jaegertracing/jaeger-ui/pull/815: Create Monitoring Tab for Jaeger UI

### Documentation

Owners: @albertteoh 

- [x] https://github.com/jaegertracing/documentation/pull/539: Add usage documentation for metrics query API (noting that this is "experimental")
- [x] https://github.com/jaegertracing/documentation/issues/553: Add usage documentation for Monitor tab UI

## Requirement - what kind of business use case are you trying to solve?
The main proposal is documented in: https://github.com/jaegertracing/jaeger/issues/2736.

The motivation is to help identify interesting traces (high qps, slow or erroneous) without knowing the service or operations up-front. 

Use cases include:
- Post deployment sanity checks across the org, or on known dependent services in the request chain.
- Monitoring and root-causing when alerted of an issue.
- Better onboarding experience for new users of Jaeger UI.
- Long-term trend analysis of QPS, errors and latencies.
- Capacity planning.

## Proposal - what do you suggest to solve the problem or improve the existing situation?

Add a new "Monitoring" tab situated after "Compare" containing service-level request rates, error rates, latency and impact (= `latency * request rate` to avoid "false positives" from low QPS endpoints with high latencies). 

The data will be sourced from [jaeger-query's new metrics endpoints](https://github.com/jaegertracing/jaeger/issues/2736).

As the jaeger-query metrics endpoints require opt-in to be enabled, the Monitor tab will have a sensible empty state, perhaps a link to documentation on how to enable metrics querying capabilities.

### Workflow
The screen will open to a per-service level set of metrics sorted, by default, on Impact. Columns are configurable by the user with other latency percentiles available, among others. A search box will be available to filter on service names. 

The user need only supply the time period to fetch metrics on (similar to Find Traces), defaulting to a 1 hour lookback.

Note the user is not required to define the step size (the period between data points), at least in this iteration, to keep the user experience as simple as possible. Instead we propose to define the step size based on a sensible heuristic based on the query period and/or the width of the chart. For example:
- `< 30m` search period -> 15s step
- `< 1h` search period -> 1m step, etc.

There are two possible actions from here in this tab:
- Click on a service to drill down to per-operation metrics.
- Click on "View all traces" to go to the Search tab with the service pre-populated and Operation filter set to "all".

#### Service metrics page
If drilling down into the service-level metrics, the page will show a summary of the RED metrics at the top along with the per-operation equivalent metrics as with the per-service metrics above. Also similarly, there will be a search box to filter on operations, and the user has the option to "View all traces" for a given operation.

#### Search tab
The search tab will be the final stage in the workflow (except of course if going back to a previous state), which is pre-populated with the service and/or operation as well as the search period. 

The search period will be sticky between each of these screens to maintain consistency in search results.

### Demo

Courtesy of @Danafrid.

![jaeger-monitor-tab-service](https://user-images.githubusercontent.com/26584478/114831799-20fcc080-9e11-11eb-9df6-5eed4ca2d1ec.jpg)

![jaeger-monitor-operation-tab](https://user-images.githubusercontent.com/26584478/114831774-19d5b280-9e11-11eb-9e69-95de1859a91e.jpg)

https://user-images.githubusercontent.com/26584478/114826556-fad42200-9e0a-11eb-92b5-6454a51f8863.mov


## Any open questions to address

- Any suggestions on charting libraries use for the larger detailed charts and the smaller row-level graphs in the table views?
- Any requirement to maintain consistency with the trace statistics table view?
- What is the preferred behaviour when a large number of services/operations are returned?
  - Show the top n results ordered by Impact by default? What if the user sorts on a different metric like errors? Just sort on the current n results or refetch from jaeger-query?
  - Show everything?
  - Paginate? (probably want to avoid this as it would require maintaining state in UI or jaeger-query)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Monitor" tab for service health metrics #2954

Proposed sub-tasks

Jaeger-Query

Jaeger-UI

Documentation

Requirement - what kind of business use case are you trying to solve?

Proposal - what do you suggest to solve the problem or improve the existing situation?

Workflow

Service metrics page

Search tab

Demo

Any open questions to address

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development