CA-372059: refactor the type of host in `squeeze.ml` #5387

duobei · 2024-01-24T11:28:02Z

use Set instead of List because set_difference implemented using List is quadratic cost

The flame graph before refactoring:

The flame graph after refactoring:

Special thanks to @psafont and @edwintorok for guides.

ocaml/idl/datamodel_lifecycle.ml

ocaml/squeezed/lib/squeeze.ml

use `Set` instead of `List` because `set_difference` implemented using `List` is quadratic cost Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

minglumlu · 2024-01-25T08:09:37Z

Could you please give me a pointer on how to read out the improvements from the flame graph?
Both of them were collected in booting up 1000 VMs?
I think maybe another test case is good to test against set_difference a b. E.g. both a and b are of almost same and big size. Only booting up 1000 VMs may not be enough to stress this function.

duobei · 2024-01-25T08:34:22Z

Could you please give me a pointer on how to read out the improvements from the flame graph? Both of them were collected in booting up 1000 VMs? I think maybe another test case is good to test against set_difference a b. E.g. both a and b are of almost same and big size. Only booting up 1000 VMs may not be enough to stress this function.

I ran the load testing by booting up 2048 VMs:

#! /bin/bash
 
set -e
D=/boot/guest
 
mkdir -p $D
cd $D
wget https://github.com/lindig/xen-test-vm/releases/download/0.1.17/test-vm.xen.gz
UUID=$(xe vm-create name-label=minion)
xe vm-param-set PV-kernel=$D/test-vm.xen.gz uuid=$UUID
xe vm-memory-limits-set uuid=$UUID dynamic-max=16777216 dynamic-min=16777216 static-max=16777216 static-min=16777216
seq 1 2048 | xargs -P16 -n1 -I% xe vm-clone new-name-description=% new-name-label=% vm=minion
xe vm-list power-state=halted --minimal | tr , '\n' | xargs -P16 -n1 -I% sh -c "echo pausing % && xe vm-start paused=true uuid=%"

then generate the flame graph in 10 minutes :
perf record -F 97 -g --call-graph dwarf -p $SQUEEZED_PID -- sleep 600 && perf script > perf.script

I have sent files of raw flame graph svg to you by slack. Opening the flame graph in web browser could get more information.

psafont · 2024-01-25T09:06:03Z

I have sent files of raw flame graph svg to you by slack. Opening the flame graph in web browser could get more information.

The file in github can be downloaded and inspected. I don't know why opening the link directly doesn't allow for inspection.

duobei · 2024-01-25T09:10:40Z

Could you please give me a pointer on how to read out the improvements from the flame graph? Both of them were collected in booting up 1000 VMs? I think maybe another test case is good to test against set_difference a b. E.g. both a and b are of almost same and big size. Only booting up 1000 VMs may not be enough to stress this function.

For more inspection, I generated two differential flame graphs:

the refactored differ from before:
the before differ from refactored:

In a differential flame graph, the red and blue areas represent the differences between two flame graphs. The color scheme is used to indicate the direction and relative magnitude of the change between the two datasets.

Red: The red areas in the differential flame graph represent an increase or growth in the performance metric being visualized. The darker the red, the larger the increase in the metric for the specific frame in the stack. This indicates that the performance metric for those functions or code paths has increased in the second dataset compared to the first.

Blue: Conversely, the blue areas in the differential flame graph represent a decrease or reduction in the performance metric. The darker the blue, the larger the decrease in the metric for the specific frame in the stack. This indicates that the performance metric for those functions or code paths has decreased in the second dataset compared to the first.

The saturation of the colors is relative to the delta, so dark red or dark blue indicates a significant change, while very light frames can be ignored as they represent minimal changes.

See more in https://www.brendangregg.com/blog/2014-10-31/cpi-flame-graphs.html.

ocaml/squeezed/lib/squeeze.ml

psafont · 2024-01-25T09:52:13Z

The high towers in the map are a result of not having tail-recursive maps. The stacks can be squashed by piping the perf data through scripts in FlameGraph like so:

curl sftp://xenserver_HOST/root/perf.script | FlameGraph/stackcollapse-perf.pl --addrs | FlameGraph/stackcollapse-recursive.pl | FlameGraph/flamegraph.pl > perf-new.svg && open perf-new.svg

…ind a domain Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

psafont

Some of this changes in the interface will need deletion from the module, as it's unused.

ocaml/squeezed/lib/squeeze.mli

psafont · 2024-01-29T13:27:07Z

Could you squash the last 3 commits into one?

duobei · 2024-01-29T13:59:07Z

Could you squash the last 3 commits into one?

Yes, of course. Done.

psafont · 2024-01-29T14:00:56Z

The changes themselves look generally good to me.
The only doubt I have are the 2 folds that revert the order of some remaining list.
Have these changes passed specific ballooning tests?

duobei · 2024-01-29T14:12:06Z

The changes themselves look generally good to me. The only doubt I have are the 2 folds that revert the order of some remaining list. Have these changes passed specific ballooning tests?

All the tests in squeeze_test.ml has passed. There may indeed be an issue with the order. I'll check that tomorrow.

psafont · 2024-01-29T14:13:43Z

I was referring to whole-product tests using the internal framework ;)

Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

psafont

Code looks good to me, the last commit passed the DMC e2e tests in job 194022

minglumlu · 2024-01-31T06:21:57Z

Thanks. Now I understand the 2nd flamegraph the before differ from refactored shows the key difference in the test: the List.mem invoked in update_cooperative_table contributes to the majority of delta (dark red). This should be a result of the elimination of set_difference in this PR (List.mem -> Set.diff).
It would be nice to separate the tests to show the improvements frommap|iter -> fold and List.mem -> Set.diff separately. It's clear for List.mem -> Set.diff in the flamegraph. But it's not clear to me for the improvement resulted by map|iter -> fold yet. If there is no quadratic cost in iteration, both of them should be linear. Or I missed something again?

minglumlu · 2024-01-31T06:29:58Z

But anyway, the review and test have shown the correctness of the changes. Especially I believe these changes would result in much better performance when the a and b in set_difference a b are both of same order of size .

github-actions · 2024-01-31T06:46:25Z

pytype_reporter extracted 49 problem reports from pytype output.

You can check the results of the job here

duobei · 2024-01-31T07:31:11Z

It would be nice to separate the tests to show the improvements frommap|iter -> fold and List.mem -> Set.diff separately. It's clear for List.mem -> Set.diff in the flamegraph. But it's not clear to me for the improvement resulted by map|iter -> fold yet. If there is no quadratic cost in iteration, both of them should be linear. Or I missed something again?

There may be no iter -> fold. There are indeed many map -> fold.

In fact, we can't use Set.map as List.map. If we want to use Set.map, we only can use DomainSet.map that Set has specifically type.

The type of List.map is: ('a -> 'b) -> 'a list -> 'b list.
The type of DomainSet.map is: (elt -> elt) -> DomainSet.t -> DomainSet.t.

So we can't use DomainSet.map to get other type except DomainSet.t.
For example, we can't use DomainSet.map to get IntSet.

Without redundant transformation, there is no alternative to use DomainSet.fold.
If we really want to use map, we can firstly transform set to list use Set.elements or Set.to_seq |> List.of_seq, Set.to_list occurs only since 5.1.
In addition, the returned list of Set.elements is sorted in increasing order with respect to the ordering Ord.compare, where Ord is the argument given to Set.Make. Set.of_seq iterates on the whole set, in ascending order.

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/idl/datamodel_lifecycle.ml Outdated Show resolved Hide resolved

duobei force-pushed the private/fezhan/CA-372059 branch from b1c2e2b to 88acde0 Compare January 24, 2024 11:38

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

psafont reviewed Jan 24, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

minglumlu reviewed Jan 25, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

minglumlu reviewed Jan 25, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

duobei added 2 commits January 25, 2024 14:44

CA-372059: refactor the type of host in squeeze.ml

bc79114

use `Set` instead of `List` because `set_difference` implemented using `List` is quadratic cost Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

CA-372059: delete the unused code

d49f33b

Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

duobei force-pushed the private/fezhan/CA-372059 branch from d344e07 to e7fcfd9 Compare January 25, 2024 06:46

psafont reviewed Jan 25, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Show resolved Hide resolved

psafont reviewed Jan 25, 2024

View reviewed changes

ocaml/squeezed/lib/squeeze.ml Outdated Show resolved Hide resolved

CA-372059: use Opt.value instead of match and use find_opt to f…

5f6c195

…ind a domain Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

duobei force-pushed the private/fezhan/CA-372059 branch from e7fcfd9 to 8668f4e Compare January 26, 2024 02:56

psafont requested changes Jan 26, 2024

View reviewed changes

duobei force-pushed the private/fezhan/CA-372059 branch from 4d3cd81 to 129220c Compare January 29, 2024 13:54

CA-372059: add an interface for squeeze.ml

229d486

Signed-off-by: Luca Zhang <feiya.zhang@cloud.com>

duobei force-pushed the private/fezhan/CA-372059 branch from 129220c to 229d486 Compare January 30, 2024 11:34

psafont approved these changes Jan 30, 2024

View reviewed changes

minglumlu approved these changes Jan 31, 2024

View reviewed changes

minglumlu merged commit 9b885bf into xapi-project:master Jan 31, 2024

Uh oh!

CA-372059: refactor the type of host in squeeze.ml #5387

CA-372059: refactor the type of host in squeeze.ml #5387

Uh oh!

Conversation

duobei commented Jan 24, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

minglumlu commented Jan 25, 2024

Uh oh!

duobei commented Jan 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

psafont commented Jan 25, 2024

Uh oh!

duobei commented Jan 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psafont commented Jan 25, 2024

Uh oh!

psafont left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psafont commented Jan 29, 2024

Uh oh!

duobei commented Jan 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

psafont commented Jan 29, 2024

Uh oh!

duobei commented Jan 29, 2024

Uh oh!

psafont commented Jan 29, 2024

Uh oh!

psafont left a comment

Choose a reason for hiding this comment

Uh oh!

minglumlu commented Jan 31, 2024

Uh oh!

minglumlu commented Jan 31, 2024

Uh oh!

github-actions bot commented Jan 31, 2024

pytype_reporter extracted 49 problem reports from pytype output.

Uh oh!

duobei commented Jan 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CA-372059: refactor the type of host in `squeeze.ml` #5387

CA-372059: refactor the type of host in `squeeze.ml` #5387

duobei commented Jan 25, 2024 •

edited

Loading

duobei commented Jan 25, 2024 •

edited

Loading

duobei commented Jan 29, 2024 •

edited

Loading