Skip to content

Commit

Permalink
[memory-infra] Document memory benchmarks
Browse files Browse the repository at this point in the history
Add documentation for memory benchmarks, system health in particular, on what
they are, where are they tracked, how to run them, how to understand results.

BUG=catapult:#3035

Review-Url: https://codereview.chromium.org/2595613002
Cr-Commit-Position: refs/heads/master@{#442278}
  • Loading branch information
perezju authored and Commit bot committed Jan 9, 2017
1 parent 093561e commit 8f6b9c0
Showing 1 changed file with 210 additions and 0 deletions.
210 changes: 210 additions & 0 deletions docs/memory-infra/memory_benchmarks.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,210 @@
# Memory Benchmarks

This document describes benchmarks available to track Chrome's and
WebView's memory usage, where they live, what they measure, how to run them,
and on how to diagnose regressions.

[TOC]

## Glossary

* **User story:** a set of actions to perform on a browser or device (e.g.
open google homepage, type "foo", click search, scroll down, visit first
result, etc.).
* **Metric:** a data aggregation process that takes a Chrome trace as input
(produced by a [Telemetry][] run) and produces a set of summary numbers as
output (e.g. total GPU memory used).
* **Benchmark:** a combination of (one or more) user stories and (one or
more) metrics.

[Telemetry]: https://github.com/catapult-project/catapult/blob/master/telemetry/README.md

## System Health

*System health* is an effort to unify top-level benchmarks (as opposite to
micro-benchmarks and regression tests) that are suitable to capture
representative user stories.

### Benchmarks

System health memory benchmarks are:

* [system_health.memory_mobile][system_health] -
user stories running on Android devices.
* [system_health.memory_desktop][system_health] -
user stories running on desktop platforms.

These benchmarks are run continuously on the [chromium.perf][] waterfall,
collecting and reporting results on the
[Chrome Performance Dashboard][chromeperf].

Other benchmarks maintained by the memory-infra team are discussed in the
[appendix](#Other-benchmarks).

[system_health]: https://chromium.googlesource.com/chromium/src/+/master/tools/perf/page_sets/system_health/
[chromium.perf]: https://build.chromium.org/p/chromium.perf/waterfall
[chromeperf]: https://chromeperf.appspot.com/report

### User stories

System health user stories are classified by the kind of interactions they
perform with the browser:

* `browse` stories navigate to a URL and interact with the page; e.g.
scroll, click on elements, navigate to subpages, navigate back.
* `load` stories just navigate to a URL and wait for the page to
load.
* `background` stories navigate to a URL, possibly interact with the
page, and then bring another app to the foreground (thus pushing the
browser to the background).
* `long_running` stories interact with a page for a longer period
of time (~5 mins).
* `blank` has a single story that just navigates to **about:blank**.

The full name of a story has the form `{interaction}:{category}:{site}` where:

* `interaction` is one the labels given above;
* `category` is used to group together sites with a similar purpose,
e.g. `news`, `social`, `tools`;
* `site` is a short name identifying the website in which the story mostly
takes place, e.g. `cnn`, `facebook`, `gmail`.

For example `browse:news:cnn` and `background:social:facebook` are two system
health user stories.

Today, for most stories a garbage collection is forced at the end of the
story and a memory dump is then triggered. Metrics report the values
obtained from this single measurement.

## Continuous monitoring

![Chrome Performance Dashboard](https://storage.googleapis.com/chromium-docs.appspot.com/79d08f59cf497c761f7099ea427704c14e9afc03.png)

To view data from one of the benchmarks on the
[Chrome Performance Dashboard][chromeperf] you should select:

* **Test suite:** The name of a *[benchmark](#Benchmarks)*.
* **Bot:** The name of a *platform or device configuration*. Sign in to also
see internal bots.
* **Subtest (1):** The name of a *[metric](#Understanding-memory-metrics)*.
* **Subtest (2):** The name of a *story group*; these have the form
`{interaction}_{category}` for system health stories.
* **Subtest (3):** The name of a *[user story](#User-stories)*
(with `:` replaced by `_`).

## How to run the benchmarks

Benchmarks may be run on a local platform/device or remotely on a try job.

### How to run locally

After building, e.g. `ChromePublic.apk`, you can run a specific system health
story with the command:

```
$SRC/tools/perf/run_benchmark run system_health.memory_mobile \
--browser android-chromium --story-filter load:search:google
```

This will run the story with a default of 3 repetitions and produce a
`results.html` file comparing results from this and any previous benchmark
runs.

![Example results.html file](https://storage.googleapis.com/chromium-docs.appspot.com/ea60207d9bb4809178fe75923d6d1a2b241170ef.png)

Other useful options for this command are:

* `--pageset-repeat [n]` - override the default number of repetitions
* `--output-format html --output-format json` - select *both* output formats
to get individual [trace files][memory-infra] in addition to the
`results.html` file.
* `--reset-results` - clear results from any previous benchmark runs in the
`results.html` file.
* `--results-label [label]` - give meaningful names to your benchmark runs,
this way it is easier to compare them.

For WebView make sure to [replace the system WebView][webview_install]
on your device and use `--browser android-webview`.

[webview_install]: https://www.chromium.org/developers/how-tos/build-instructions-android-webview

### How to run a try job

Given a patch on a chromium checkout, try jobs provide a convenient way to
evaluate its memory implications on devices or platforms which
may not be immediately available to developers.

To start a try job [upload a CL][contributing] and run the command, e.g.:

```
$SRC/tools/perf/run_benchmark try android-nexus5 system_health.memory_mobile
```

This will run all of the system health stories for you, and conveniently
provide a `results.html` file comparing measurements with/without your patch.
Options like `--story-filter` and `--pageset-repeat` may also be passed to
this command.

To see the full list of available try bots run the command:

```
$SRC/tools/perf/run_benchmark try list
```

[contributing]: https://www.chromium.org/developers/contributing-code

## Understanding memory metrics

There is a large number of [memory-infra][] metrics, breaking down usage
attributed to different components and processes.

![memory-infra metrics](https://storage.googleapis.com/chromium-docs.appspot.com/a73239c6367ed0f844500e51ce1e04556cb99b4f.png)

Most memory metrics have the form
`memory:{browser}:{processes}:{source}:{component}:{kind}`
where:

* **browser:** One of `chrome` or `webview`.
* **processess:** One of `browser_process`, `renderer_processess`,
`gpu_process`, or `all_processess`.
* **source:** One of `reported_by_chrome` or `reported_by_os`
* **component:** May be a Chrome component, e.g. `skia` or `sqlite`;
details about a specific component, e.g. `v8:heap`; or a class of memory
as seen by the OS, e.g. `system_memory:native_heap` or `gpu_memory`.
* **kind:** The kind of memory being reported. For metrics reported by
Chrome this usually is `effective_size` (others are `locked_size`
and `allocated_objects_size`); for metrics by the OS this usually is
`proportional_resident_size` (others are `peak_resident_size` and
`private_dirty_size`).

[memory-infra]: /memory-infra/README.md

## Appendix

### Other benchmarks

Other benchmarks maintained by the memory-infra team are:

* [memory.dual_browser_test][memory_py] - cycle between doing Google searches
on a WebView-based browser (a stand-in for the Google Search app) and
loading pages on Chrome. Runs on Android devices only.

Story groups are either `on_chrome` or `on_webview`, indicating the browser
in foreground at the moment when the memory measurement was made.

* [memory.long_running_dual_browser_test][memory_py] - same as above, but the
test is run for 60 iterations keeping both browsers alive for the whole
duration of the test and without forcing garbage collection. Intended as a
last-resort net to catch memory leaks not apparent on shorter tests.

* [memory.top_10_mobile][memory_py] - cycle between loading a page on Chrome,
pushing the browser to the background, and then back to the foreground.
*(To be deprecated in favor of system_health.memory_mobile.)*

Story groups are either `foreground` or `background` indicating the state
of the browser at the time of measurement.

* [memory.top_10_mobile_stress][memory_py] - same as above, but keeps a single
instance of the browser open for 5 repetitions. *(To be deprecated.)*

[memory_py]: https://chromium.googlesource.com/chromium/src/+/master/tools/perf/benchmarks/memory.py

0 comments on commit 8f6b9c0

Please sign in to comment.