improve the `LocalExecutor` builders #427

HippoBaro · 2021-09-28T01:17:52Z

This PR makes a series of changes to the builders and placement
strategies:

The Placement struct is renamed to PoolPlacement as it maps
placements for a pool of executors.
Embed the number of executors to create directly in the placement
policy. Instead of having two orthogonal configurations for a
LocalExecutorPoolBuilder(the number of executors to create and the CPU
placement policy) that can conflict with each other (for instance if we
provide a custom placement and a different number of shards to create),
merge the two together. A PoolPlacement now also takes (when
appropriate), the number of executors to create in addition to the
CpuSet to use. The LocalExecutorPoolBuilder constructor now takes a
PoolPlacement directly instead of the number of executors to create.
This simplifies the API and makes it harder to misuse.
Align the LocalExecutorBuilder API with the pooled version. The
pooled version is presently strictly superior to the non-pooled one
because it uses the PoolPlacement logic. It is therefore possible to
spawn a single executor that's fenced to a collection of CPUs using the
pooled API, but not with the non-pooled one. With the existing
LocalExecutorBuilder, an executor is either pinned to a specific CPU,
or to none at all. To remedy this, this commit creates a new Placement
enum that contains a subset of placement strategies that make sense for
a single executor. In turn, LocalExecutorBuilder::new now takes a
Placement strategy and the pin_to_cpu API is removed.

glommer · 2021-09-28T10:24:48Z

glommer

I like the fact that Placement is now the way to specify cpus in the single executor case. The only reason why it wasn't done that way of course is that the single executor preceded Placement so the consistency is good.

I am not sure I like the name Exact. I think I would prefer Fixed. Maybe there is even a case to be made that this is really just a special case of Fenced. Happy to hear from others.

davidblewett

This looks good. There are some doc changes that need to be adjusted to the new PoolPlacement variant format.

glommio/src/executor/placement/mod.rs

trtsl · 2021-09-28T16:42:49Z

I think the changes make sense. Though I do have a couple of thoughts related to PoolPlacements / CpuSets that may be worth revisiting but are not directly related to this pull request:

Given that the api now has several set operations, it may we worth having a closer look at PartialEq. It is possible for two placements to compare equal but for the pool builder to use different cpu sets (because it relies on iterating a hashset and the number of cpus requested may be less than the set size); it is also possible for cpus sets to compare not equal but run on the same cpus (when the cpu sets are different but the number of cpus selected is less than set size). Finally, Placement::Custom still relies on a Vec which is dependent on ordering. One option: it would be possible for the implementation of PartialEq to check what CpuSets will in fact be used and then return the result ... perhaps even allowing e.g. Placement::Custom to equal Placement::MaxSpread if all the CpuSets that the pool builder will use match.
The changes from this weekend now allow construction of arbitrary CpuSets by the user (i.e. with invalid CpuLocations). An invalid CpuSet in the pool builder would likely fail silently.

glommio/src/executor/placement/mod.rs

github-actions · 2021-10-31T22:31:08Z

Greetings @HippoBaro!

It looks like your PR added a new or changed an existing dependency, and CI has failed to validate your changes.
Some possible reasons this could happen:

One of the dependencies you added uses a restricted license. See deny.toml for a list of licenses we allow;
One of the dependencies you added has a known security vulnerability;
You added or updated a dependency and didn't update the LICENSE-3rdparty.csv file. To do so, run the following and commit the changes:

$ cargo install cargo-license
$ cargo license --all-features -a -j --no-deps -d | jq -r '(["Component","Origin","License","Copyright"]) as $cols | map(. as $row | ["name", "repository", "license", "authors"] | map($row[.])) as $rows | $cols, $rows[] | @csv' > LICENSE-3rdparty.csv

Thank you!

Instead of having two orthogonal configurations (the number of executors to create and the CPU placement policy) that can conflict with each other (for instance, if we provide a custom placement and a different number of shards to create), merge the two. A `Placement` now takes (when appropriate) the number of executors to create in addition to the `CpuSet` to use. The `LocalExecutorPoolBuilder` constructor now takes a `Placement` directly instead of the number of executors to create.

The next commit in this series will reintroduce the `Placement` that will be suited for when creating a single executor.

The pooled version is presently superior to the non-pooled one because it uses the `PoolPlacement` logic. With the existing `LocalExecutorBuilder,` an executor is either pinned to a specific CPU or none at all. Therefore, it's possible to spawn a single executor that's fenced to a collection of CPUs using the pooled API but not with the non-pooled one. This commit creates a new `Placement` enum that contains placement strategies that make sense for a single executor. In turn, `LocalExecutorBuilder::new` now takes a placement strategy, and the `pin_to_cpu` API is removed.

Now that we allow users to provide hand-crafter `CpuSet,` we open the door to receiving CPUs that don't exist. To prevent this we systematically check that they are part of the `online()` set of CPUs that are available to the current process.

HippoBaro · 2021-11-04T17:40:54Z

The changes from this weekend now allow construction of arbitrary CpuSets by the user (i.e. with invalid CpuLocations). An invalid CpuSet in the pool builder would likely fail silently.

Yeah, this would be pretty bad. I've added a runtime check that asserts any CpuLocation we deal with is part of the CPU set available to the current process.

I haven't acted on the PartialEq feedback here because I feel this would deserve a dedicated PR.

glommer approved these changes Sep 28, 2021

View reviewed changes

davidblewett requested changes Sep 28, 2021

View reviewed changes

trtsl reviewed Sep 28, 2021

View reviewed changes

glommio/src/executor/placement/mod.rs Show resolved Hide resolved

HippoBaro force-pushed the better_placement branch from 310e418 to 4cd1db7 Compare October 31, 2021 22:28

HippoBaro added 4 commits November 4, 2021 18:34

rename Placement to PoolPlacement

9c68685

The next commit in this series will reintroduce the `Placement` that will be suited for when creating a single executor.

HippoBaro force-pushed the better_placement branch from 4cd1db7 to 37c7f69 Compare November 4, 2021 17:37

update to rust 1.56

0e3a34e

HippoBaro merged commit 15e9291 into DataDog:master Nov 4, 2021

HippoBaro deleted the better_placement branch November 4, 2021 18:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve the `LocalExecutor` builders #427

improve the `LocalExecutor` builders #427

HippoBaro commented Sep 28, 2021

glommer commented Sep 28, 2021

glommer left a comment

davidblewett left a comment

trtsl commented Sep 28, 2021

github-actions bot commented Oct 31, 2021

HippoBaro commented Nov 4, 2021

improve the LocalExecutor builders #427

improve the LocalExecutor builders #427

Conversation

HippoBaro commented Sep 28, 2021

glommer commented Sep 28, 2021

glommer left a comment

Choose a reason for hiding this comment

davidblewett left a comment

Choose a reason for hiding this comment

trtsl commented Sep 28, 2021

github-actions bot commented Oct 31, 2021

HippoBaro commented Nov 4, 2021

improve the `LocalExecutor` builders #427

improve the `LocalExecutor` builders #427