change min/max to overall_min/overall_max + update comparison results publisher #687

OVI3D0 · 2024-10-29T21:50:11Z

Description

This change updates min/max values in aggregated results to reflect the min/max values across all test executions, rather than just an average of the min/max values. I also changed the names from min and max to overall_min and overall_max for clarity. Changes were also made to the comparison results publisher class, so if an aggregated result with these new metrics were to be used in a comparison, these new 'overall' min and max values can still be compared without issue.

Issues Resolved

#684

Testing

New functionality includes testing

make test

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

gkamat

Please consider adding (or updating) a test as well.

gkamat · 2024-11-05T12:18:07Z

osbenchmark/aggregator.py

+                            weighted_metrics[metric]['overall_max'] = max(item_values)
+                        elif item_key == 'median':
+                            weighted_sum = sum(value * iterations for value in item_values)
+                            total_iterations = iterations * len(item_values)


Isn't this already available from above and does not need to be re-computed?

True, I went ahead and moved this calculation outside of the loop.

gkamat · 2024-11-05T12:36:41Z

osbenchmark/aggregator.py

+                        else:
+                            weighted_sum = sum(value * iterations for value in item_values)
+                            total_iterations = iterations * len(item_values)
+                            weighted_metrics[metric][item_key] = weighted_sum / total_iterations


The elif is probably not needed here, since the code in the two arms is the same?

+1 we can just have a comment in the else statement right after else stating what cases this for

else: # for items like median

Got it, I added a comment to clarify what cases this is for

gkamat · 2024-11-05T12:39:45Z

osbenchmark/results_publisher.py

@@ -464,16 +464,16 @@ def _write_results(self, metrics_table, metrics_table_console):
                            data_plain=metrics_table, data_rich=metrics_table_console)

    def _publish_throughput(self, baseline_stats, contender_stats, task):
-        b_min = baseline_stats.metrics(task)["throughput"]["min"]
+        b_min = baseline_stats.metrics(task)["throughput"].get("overall_min") or baseline_stats.metrics(task)["throughput"]["min"]


It would probably be better to set the overall_min key prior, so dealing with a special case won't be necessary.

I think this logic is necessary since the results publisher is also used for normal test executions which will not contain 'overall' min/max values

IanHoang · 2024-11-11T21:30:03Z

osbenchmark/results_publisher.py

        b_unit = baseline_stats.metrics(task)["throughput"]["unit"]

-        c_min = contender_stats.metrics(task)["throughput"]["min"]
+        c_min = contender_stats.metrics(task)["throughput"].get("overall_min") or contender_stats.metrics(task)["throughput"]["min"]


This change was made in the ComparisonResultsPublisher but should they also be made in the SummaryResultsPublisher?

I don't think so, since the summary results publisher isn't used by the aggregator class. Although if we wanted to publish results after aggregation then this would be necessary

IanHoang

Could you add tests to results_publisher to confirm the changes work?

IanHoang

Left a few comments

… publisher Signed-off-by: Michael Oviedo <mikeovi@amazon.com>

OVI3D0 · 2024-11-12T23:59:38Z

Could you add tests to results_publisher to confirm the changes work?

I added a test for this, but let me know if I should add more!

I also updated the unit tests for the aggregator class.

Signed-off-by: Michael Oviedo <mikeovi@amazon.com>

OVI3D0 requested review from IanHoang, gkamat, beaioun, cgchinmay, rishabh6788 and VijayanB as code owners October 29, 2024 21:50

gkamat requested changes Nov 5, 2024

View reviewed changes

IanHoang reviewed Nov 11, 2024

View reviewed changes

IanHoang requested changes Nov 11, 2024

View reviewed changes

change min/max to overall_min/overall_max + update comparison results…

af6e812

… publisher Signed-off-by: Michael Oviedo <mikeovi@amazon.com>

OVI3D0 force-pushed the adjust-metrics branch from 73d3486 to 07830c9 Compare November 12, 2024 23:57

OVI3D0 force-pushed the adjust-metrics branch from 07830c9 to 8f45161 Compare November 13, 2024 00:01

address PR comments + update unit tests

16d3428

Signed-off-by: Michael Oviedo <mikeovi@amazon.com>

OVI3D0 force-pushed the adjust-metrics branch from 8f45161 to 16d3428 Compare November 13, 2024 00:21

OVI3D0 requested review from IanHoang and gkamat November 13, 2024 18:10

OVI3D0 closed this Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change min/max to overall_min/overall_max + update comparison results publisher #687

change min/max to overall_min/overall_max + update comparison results publisher #687

OVI3D0 commented Oct 29, 2024

gkamat left a comment

gkamat Nov 5, 2024

OVI3D0 Nov 12, 2024

gkamat Nov 5, 2024

IanHoang Nov 11, 2024

OVI3D0 Nov 12, 2024

gkamat Nov 5, 2024

OVI3D0 Nov 12, 2024

IanHoang Nov 11, 2024

OVI3D0 Nov 12, 2024

IanHoang left a comment

IanHoang left a comment

OVI3D0 commented Nov 12, 2024

change min/max to overall_min/overall_max + update comparison results publisher #687

change min/max to overall_min/overall_max + update comparison results publisher #687

Conversation

OVI3D0 commented Oct 29, 2024

Description

Issues Resolved

Testing

gkamat left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IanHoang left a comment

Choose a reason for hiding this comment

IanHoang left a comment

Choose a reason for hiding this comment

OVI3D0 commented Nov 12, 2024