Add naive benchmark #321

XAMPPRocky · 2021-07-07T16:04:44Z

This adds a naive criterion benchmark suite that runs two separate benchmarks to create a comparison. In both cases we're simply sending packets back and forth, but the first case is the UDP socket talking directly to another socket, and the second case is with Quilkin as a middle-man between the two. It should be strongly noted that this benchmark heavily favours the direct case, as there is zero latency or drop, it mostly is blocked by syscalls, so this is not a fair comparison of Quilkin's performance in the real world, however what it does is provide a baseline for the overhead of processing a packet in Quilkin.

I've provided what it looks like on 2019 MBP and currently Quilkin is about 2/3's slower than reading it directly. (Unit is in μ/secs)

XAMPPRocky · 2021-07-12T13:02:25Z

Updated the benchmark so that it now benches different sized packets, so that we can get a better sense of how it's working under different workloads. One interesting observation from the report was that 1500 byte packet messages were actually faster to send at times than 508 byte messages.

quilkin-bot · 2021-07-12T13:03:42Z

Build Succeeded 🥳

Build Id: 26baa30b-8b9d-4017-9907-d680e254ca14

To build this version:

git fetch git@github.com:googleforgames/quilkin.git pull/321/head:pr_321 && git checkout pr_321
cargo build

Allocating the 64K buffer on the stack makes the future expensive to move. This allocates the buffer on the heap instead. Noticed some significant perf improvements in load tests via https://github.com/majek/dump/tree/master/how-to-receive-a-million-packets Tried using the naive benchmark from #321 but that doesn't seem consisten atm - constantly got perf regressions/improvements on reruns even with no code change (I think either running both the proxy and the mock server within the same process/scheduler adds too much noise or we don't have a large enough unit of work) Relates to #330

* Move packet buffer to heap Allocating the 64K buffer on the stack makes the future expensive to move. This allocates the buffer on the heap instead. Noticed some significant perf improvements in load tests via https://github.com/majek/dump/tree/master/how-to-receive-a-million-packets Tried using the naive benchmark from #321 but that doesn't seem consisten atm - constantly got perf regressions/improvements on reruns even with no code change (I think either running both the proxy and the mock server within the same process/scheduler adds too much noise or we don't have a large enough unit of work) Relates to #330 * Initialize buffer Co-authored-by: Mark Mandel <markmandel@google.com>

build/README.md

markmandel

LGTM! Really like the html reports for comparison, just had a small addition I'd like to see for those not as familiar with Rust tooling.

Side thought: Do we want to set this up to run nightly and dump the reports into a cloud storage bucket somewhere?

Approving so you can merge when ready 👍🏻

XAMPPRocky · 2021-07-19T23:31:38Z

Side thought: Do we want to set this up to run nightly and dump the reports into a cloud storage bucket somewhere?

Well, if we do we shouldn't try to use it for measuring performance regressions. Criterion specifically warns against running on CI environments, because very sensitive to noisy neighbouring. I don't know how much of an issue that would be with GCP.

We could still publish them just to look at, we could add it as part of the deploy-docs job.

markmandel · 2021-07-19T23:43:53Z

Well, if we do we shouldn't try to use it for measuring performance regressions. Criterion specifically warns against running on CI environments, because very sensitive to noisy neighbouring. I don't know how much of an issue that would be with GCP.

Ah interesting. We're currently running on E2 series VMs, which are not VMs that are specifically built for isolation. I'd be surprised if it was as noisy as a CI platform, but it's not totally isolated.

Something to think about at least, or maybe experiment with, see what results we get.

Co-authored-by: Mark Mandel <markmandel@google.com>

quilkin-bot · 2021-07-23T01:09:18Z

Build Succeeded 🥳

Build Id: 6dcdd5ac-0a19-4e87-912b-f242cde1ce70

To build this version:

git fetch git@github.com:googleforgames/quilkin.git pull/321/head:pr_321 && git checkout pr_321
cargo build

quilkin-bot · 2021-07-23T19:43:37Z

Build Succeeded 🥳

Build Id: 99868598-61d7-4c71-8021-d75ee1743f79

To build this version:

git fetch git@github.com:googleforgames/quilkin.git pull/321/head:pr_321 && git checkout pr_321
cargo build

google-cla bot added the cla: yes label Jul 7, 2021

XAMPPRocky requested review from markmandel and iffyio July 7, 2021 16:04

This comment has been minimized.

Sign in to view

XAMPPRocky mentioned this pull request Jul 12, 2021

Get packet buffer size from operating system #330

Open

XAMPPRocky force-pushed the benchmark branch 2 times, most recently from 9821be5 to dc04a97 Compare July 12, 2021 10:19

This comment has been minimized.

Sign in to view

XAMPPRocky force-pushed the benchmark branch from dc04a97 to 2911470 Compare July 12, 2021 10:53

This comment has been minimized.

Sign in to view

Add naive benchmark

04350e1

XAMPPRocky force-pushed the benchmark branch from 2911470 to 04350e1 Compare July 12, 2021 12:56

iffyio mentioned this pull request Jul 15, 2021

Move packet buffer to heap #334

Merged

markmandel reviewed Jul 19, 2021

View reviewed changes

build/README.md Show resolved Hide resolved

markmandel approved these changes Jul 19, 2021

View reviewed changes

Update build/README.md

5290756

Co-authored-by: Mark Mandel <markmandel@google.com>

Merge branch 'main' into benchmark

04b3698

markmandel merged commit 64c25fc into main Jul 23, 2021

markmandel deleted the benchmark branch July 23, 2021 20:18

markmandel added the area/performance Anything to do with Quilkin being slow, or making it go faster. label Jul 23, 2021

markmandel added the kind/feature New feature or request label Jul 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add naive benchmark #321

Add naive benchmark #321

XAMPPRocky commented Jul 7, 2021 •

edited

Loading

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

XAMPPRocky commented Jul 12, 2021

quilkin-bot commented Jul 12, 2021

markmandel left a comment

XAMPPRocky commented Jul 19, 2021

markmandel commented Jul 19, 2021

quilkin-bot commented Jul 23, 2021

quilkin-bot commented Jul 23, 2021

Add naive benchmark #321

Add naive benchmark #321

Conversation

XAMPPRocky commented Jul 7, 2021 • edited Loading

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

XAMPPRocky commented Jul 12, 2021

quilkin-bot commented Jul 12, 2021

markmandel left a comment

Choose a reason for hiding this comment

XAMPPRocky commented Jul 19, 2021

markmandel commented Jul 19, 2021

quilkin-bot commented Jul 23, 2021

quilkin-bot commented Jul 23, 2021

XAMPPRocky commented Jul 7, 2021 •

edited

Loading