[Search Pipelines] Base Benchmarks #7782

macohen · 2023-05-26T20:46:51Z

Benchmarking is a good way to measure how changes to our code impact performance over time. This RFC should cover benchmarking for Search Pipelines and processors that will be included in the release.

Use the OpenSearch Benchmarking Tools to benchmark a very simple search (benchmark pipelines, not search):

using the _search endpoint to baseline
using a search pipeline with no processors (how much overhead for search pipelines vs _search alone)
for each processor, benchmark a pipeline with only that processor in use. (how much overhead for each processor)
for all processors available in core, benchmark a pipeline with all of them in use (this may be a little silly if we have 12 reranking processors), but if we think of it as eXtreme Search Pipelining it will sound cooler.
encourage, but do not require processors that are not part of the OpenSearch Project release to benchmark their processors, as well.

What can we do in the short term?
Do these make sense for benchmarks? What else would we want to measure?
How do we set up the OpenSearch Benchmarking tools for this?

msfroh · 2023-05-30T19:48:10Z

using a search pipeline with no processors (how much overhead for search pipelines vs _search alone)

These are identical -- they follow the same code path. No search pipeline uses the "no-op" pipeline (which is a pipeline without processors).

noCharger · 2023-06-22T15:53:39Z

Search Pipelines and Processors Benchmarking Plan

Goal

The primary goal of this benchmark is to evaluate the performance impact of the various search pipelines and processors available in the OpenSearch Project release. We will measure this by comparing the performance of the _search endpoint with pipelines and processors of various complexities.

General Assumptions

All tests are performed in a controlled and isolated environment with the same hardware specifications, avoiding external factors that may affect the benchmark results.
Tests are conducted using the same set of data to maintain consistency.
The performance measurements focus on the time taken to process the queries, but other metrics such as CPU usage and memory consumption can also be considered.

Benchmarks

The following table outlines the tests we plan to conduct, their features, and any notes related to them:

Feature	Test Case	Notes
Baseline	Use the _search endpoint alone without any pipelines or processors.	This will provide the baseline for comparison.
Pipeline without Processors	Use a search pipeline without any processors.	Measure the overhead for no-op pipeline alone.
Single Processor	For each processor, create a pipeline with only that processor in use. Also query with the same processor via _search request as ad-hoc.	Measure the overhead for each processor.
All Core Processors	Create a pipeline with all available processors in use. Also query with the same processor via _search request as ad-hoc.	An extreme test to measure overall impact.

Future Work

While this benchmarking plan provides a good starting point, we will need to iterate and refine it based on our findings. The results of these tests will help us understand how different search pipelines and processors impact the overall performance of our system and guide us in future optimization efforts.

noCharger · 2023-07-11T17:23:50Z

For 2.9 we will have the data around. For 2.10, we are planning to have the benchmark dashboard integrated.

noCharger · 2023-08-15T18:04:10Z

Optional:

1. Support search pipeline CRUD in python client [FEATURE] Add support for search pipeline APIs opensearch-py#474

There are two main effort:

1. Add new runner in OSB Add support for create search pipeline opensearch-benchmark#364
2. Add new operations and test procedures in workload repo Add search pipeline test procedures opensearch-benchmark-workloads#101

noCharger · 2023-09-19T17:23:31Z

Close this issue since all tasks are merged

macohen added this to Search Project Board May 26, 2023

macohen converted this from a draft issue May 26, 2023

github-actions bot added the untriaged label May 26, 2023

macohen added Search Search query, autocomplete ...etc and removed untriaged labels May 26, 2023

msfroh moved this to Next (Next Quarter) in Search Project Board May 30, 2023

macohen moved this from Next (Next Quarter) to Now(This Quarter) in Search Project Board Jun 12, 2023

noCharger self-assigned this Jun 22, 2023

noCharger added the v2.9.0 'Issues and PRs related to version v2.9.0' label Jul 3, 2023

noCharger moved this from Now(This Quarter) to 🏗 In progress in Search Project Board Jul 8, 2023

noCharger added v2.10.0 and removed v2.9.0 'Issues and PRs related to version v2.9.0' labels Jul 11, 2023

noCharger moved this from 🏗 In progress to Now(This Quarter) in Search Project Board Jul 17, 2023

mingshl moved this from Now(This Quarter) to 🏗 In progress in Search Project Board Aug 14, 2023

noCharger moved this from 🏗 In progress to 👀 In review in Search Project Board Aug 17, 2023

noCharger moved this from 👀 In review to ✅ Done in Search Project Board Aug 23, 2023

noCharger closed this as completed Sep 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Search Pipelines] Base Benchmarks #7782

[Search Pipelines] Base Benchmarks #7782

macohen commented May 26, 2023 •

edited

Loading

msfroh commented May 30, 2023

noCharger commented Jun 22, 2023

noCharger commented Jul 11, 2023

noCharger commented Aug 15, 2023 •

edited

Loading

noCharger commented Sep 19, 2023

[Search Pipelines] Base Benchmarks #7782

[Search Pipelines] Base Benchmarks #7782

Comments

macohen commented May 26, 2023 • edited Loading

msfroh commented May 30, 2023

noCharger commented Jun 22, 2023

Search Pipelines and Processors Benchmarking Plan

Goal

General Assumptions

Benchmarks

Future Work

noCharger commented Jul 11, 2023

noCharger commented Aug 15, 2023 • edited Loading

noCharger commented Sep 19, 2023

macohen commented May 26, 2023 •

edited

Loading

noCharger commented Aug 15, 2023 •

edited

Loading