Implement check-result: test check failure affects test result by default #3239

martinhoyer · 2024-09-25T16:20:29Z

Trying to address #3185

Pull Request Checklist

tmt/result.py

martinhoyer · 2024-09-26T17:51:57Z

Rebased to @happz's result-store-original-result #3147 and tried to use beakerlib for the first time. Feels werid to do all the assertgreps, but I've been merely following the existing examples.

Still proof of concept, but at least I finally figured that the "after-test" check were not available without the change in execute/internal.py.

happz · 2024-09-26T17:52:36Z

@martinhoyer if you change the base branch to the one from #3147, the diff should reduce a bit.

tmt/result.py

martinhoyer · 2024-09-30T13:47:31Z

tmt.utils.GeneralError: Test check 'dmesg' was not found in check registry.

perplexing

tmt/result.py

tests/execute/result/check_results/main.fmf

docs/releases.rst

spec/tests/check.fmf

spec/tests/result.fmf

tmt/result.py

tmt/checks/__init__.py

martinhoyer · 2024-10-09T19:33:25Z

Trying to implement the check results as discussed yesterday. Started from scratch. Still don't understand specs, schemas and docs generation. Why can't we just have a one place to define things at?

We can. It somehow started separately, in an organic way, as projects grow, and the amount of work necessary to turn the tide and introduce a single source of truth is not trivial. The work is ongoing, and one possible answer might be that I, who also dislike the currently fractured state of things, am unable to change it all in one night.

Right, but isn't that a sunk cost fallacy? I'm not saying you should do it alone in one night, but perhaps have a defined "state" to strive for and think about whether it wouldn't be worth working towards it in a long run.

happz · 2024-10-09T19:40:54Z

Trying to implement the check results as discussed yesterday. Started from scratch. Still don't understand specs, schemas and docs generation. Why can't we just have a one place to define things at?

We can. It somehow started separately, in an organic way, as projects grow, and the amount of work necessary to turn the tide and introduce a single source of truth is not trivial. The work is ongoing, and one possible answer might be that I, who also dislike the currently fractured state of things, am unable to change it all in one night.

Right, but isn't that a sunk cost fallacy? I'm not saying you should do it alone in one night, but perhaps have a defined "state" to strive for and think about whether it wouldn't be worth working towards it in a long run.

Yeah, that was just an example. There is a long-term goal, it just takes so many small steps to get there. Just recently plugin docs began to be generated from plugin sources. It's useful on its own, but it also served as an experiment on what would we need to render plugin schema from plugin sources. And so on, we're moving toward a much simpler state of things, with fewer sources of truth, it's just very slow.

martinhoyer · 2024-10-11T19:31:05Z

Not at all sure about what's happening in execute/internal

martinhoyer · 2024-10-16T11:07:48Z

tmt/schemas/common.yaml

@@ -493,15 +493,3 @@ definitions:
    type: string
    # yamllint disable-line rule:line-length
    pattern: "^\\d{2,}-\\d{2}-\\d{2}T\\d{2}:\\d{2}:\\d{2}\\.\\d+\\+\\d{2}:\\d{2}$"
-


Does it make sense to move it to test.yaml, or is check expected to be used elsewhere in future?

I am hoping we can move the triggering of check to the plan, with the expectation that the check will operate on each test in the plan. But I am not sure if that work has been green lighted yet.

martinhoyer · 2024-10-16T13:54:25Z

docs/templates/plugins.rst.j2

@@ -30,8 +30,10 @@
            {% else %}
    Default: {% for default_item in actual_default %}``{{ default_item.pattern | default(default_item) }}``{% if not loop.last %}, {% endif %}{% endfor %}
            {% endif %}
+        {% elif actual_default.__class__.__name__ == 'CheckResultInterpret' %}


Ugly workaround.
@happz Do you have any cleaner solution to point me to please? I spent way more time than I'd like to admit trying to solve this.

Yeah, this is not very nice :/ What fields forced you to take this road? result is the new one, but it's a string, like how, so I don't immediately recall what might be a problem. What's logged by the script?

Nevermind, I can try it out.

It says CheckResultInterpret.RESPECT is the 'actual_default'. I've tried to add exporter=lambda result: result.value to Check.result field, but to no avail.

Yep, got it, it's an enum, I suppose it's the first one in plugins, and we can solve it once and for all :)

diff --git a/docs/scripts/generate-plugins.py b/docs/scripts/generate-plugins.py index a37d83c1..9368f731 100755 --- a/docs/scripts/generate-plugins.py +++ b/docs/scripts/generate-plugins.py @@ -1,6 +1,7 @@ #!/usr/bin/env python3 import dataclasses +import enum import sys import textwrap from typing import Any @@ -106,6 +107,12 @@ def container_intrinsic_fields(container: ContainerClass) -> list[str]: return field_names +def is_enum(value: Any) -> bool: + """ Find out whether a given value is an enum member """ + + return isinstance(value, enum.Enum) + + def _create_step_plugin_iterator(registry: tmt.plugins.PluginRegistry[tmt.steps.Method]): """ Create iterator over plugins of a given registry """ @@ -184,6 +191,7 @@ def main() -> None: STEP=step_name, PLUGINS=plugin_generator, REVIEWED_PLUGINS=REVIEWED_PLUGINS, + is_enum=is_enum, container_fields=tmt.utils.container_fields, container_field=tmt.utils.container_field, container_ignored_fields=container_ignored_fields, diff --git a/docs/templates/plugins.rst.j2 b/docs/templates/plugins.rst.j2 index 5b99149c..5969cc4c 100644 --- a/docs/templates/plugins.rst.j2 +++ b/docs/templates/plugins.rst.j2 @@ -30,7 +30,7 @@ {% else %} Default: {% for default_item in actual_default %}``{{ default_item.pattern | default(default_item) }}``{% if not loop.last %}, {% endif %}{% endfor %} {% endif %} - {% elif actual_default.__class__.__name__ == 'CheckResultInterpret' %} + {% elif is_enum(actual_default) %} Default: ``{{ actual_default.value }}`` {% else %} {% set _ = LOGGER.warn("%s/%s.%s: could not render default value, '%s'" | format(STEP, plugin_id, field_name, actual_default), shift=0) %}

Nice, thanks.

martinhoyer · 2024-10-16T14:05:05Z

@psss @happz I've marked the 'outdated' discussions as resolved, as it got quite messy with all the refactoring - sorry. I believe all the points have been incorporated though.

martinhoyer · 2024-10-16T16:57:45Z

/packit build

tmt/steps/execute/internal.py

martinhoyer · 2024-10-18T16:06:56Z

tests/execute/result/basic.sh

@@ -38,7 +38,7 @@ rlJournalStart
        run   "errr"   "/test/error"          ""         2
        run   "pass"   "/test/xfail-fail"     "fail"     0
        run   "fail"   "/test/xfail-pass"     "pass"     1


Are we ok with this? I see no reason to add note when original and actual results are the same, no?

martinhoyer · 2024-10-18T16:15:03Z

tmt/result.py


-        return _result.interpret_result(invocation.test.result)
+        interpret_checks = {check.how: check.result for check in invocation.test.check}


This way doesn't really work when multiple check.how/check.name are the same. There surely has to be a better way to add/match CheckResultInterpret to each Check?

I believe we spoke about a way: there is a trivial mapping between a result and its parent test, TestInvocation stores results & points to the test. We did not needed to transition from check result to a check, therefore there is no such mapping yet, we need to establish one. My proposal was to add a dedicated mapping to TestInvocation class which would map from CheckResult to Check instances, and it would be populated in _run_checks_for_test() (most likely), as that is the place where check results are born. Something like this:

diff --git a/tmt/steps/execute/__init__.py b/tmt/steps/execute/__init__.py index 407b4ac0..7f77d179 100644 --- a/tmt/steps/execute/__init__.py +++ b/tmt/steps/execute/__init__.py @@ -16,6 +16,7 @@ import fmf.utils import tmt import tmt.base +import tmt.checks import tmt.log import tmt.steps import tmt.utils @@ -175,6 +176,9 @@ class TestInvocation: results: list[Result] = dataclasses.field(default_factory=list) check_results: list[CheckResult] = dataclasses.field(default_factory=list) + check_result_to_check: dict[CheckResult, tmt.checks.Check] = dataclasses.field( + default_factory=dict) + check_data: dict[str, Any] = field(default_factory=dict) return_code: Optional[int] = None @@ -954,6 +958,8 @@ class ExecutePlugin(tmt.steps.Plugin[ExecuteStepDataT, None]): result.end_time = format_timestamp(timer.end_time) result.duration = format_duration(timer.duration) + invocation.check_result_to_check[result] = check + results += check_results return results

It's not clean, it's not nice, I'd like to make it look and feel like the test invocation, but I don't have that ready and I promise to polish it later. With this, the following should work: use this mapping to collect check result/interpretation pairs, and use the right one for the given check result when calling the interpretation method:

interpret_checks = { check_result: check.result for check_result, check in invocation.check_result_to_check.items() } ... check_result.interpret_check_result(interpret_checks[check_result]))

martinhoyer · 2024-10-18T16:18:10Z

tmt/result.py

+        if interpret == CheckResultInterpret.INFO:
+            self.result = ResultOutcome.INFO
+
+        elif interpret == CheckResultInterpret.XFAIL and self.event != CheckEvent.BEFORE_TEST:


I was thinking about adding a logic that would check for both before and after and make the xfail 'pass' only if after or both are failed, but wasn't sure if it's needed. ?

On second thought, this does need to be improved.
So, what are the expectations?
before fails > pass
before pass > pass
after fails > pass
after fails > fail
?

martinhoyer added status | need help Extra attention is needed status | need tests Test coverage to be added for the affected code status | need docs Documentation to be added for the affected code area | results Related to how tmt stores and shares results labels Sep 25, 2024

martinhoyer self-assigned this Sep 25, 2024

martinhoyer mentioned this pull request Sep 26, 2024

Store the original test, check and subresult outcome in results #3147

Merged

5 tasks

martinhoyer force-pushed the check-results branch from 345eced to 6ace91c Compare September 26, 2024 17:42

martinhoyer changed the title ~~Have failure in check result propagate to the test result~~ Implement check-result: test check failure affects test result by default Sep 26, 2024

happz reviewed Sep 26, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

happz reviewed Sep 26, 2024

View reviewed changes

tmt/result.py Show resolved Hide resolved

happz reviewed Sep 27, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

martinhoyer commented Sep 30, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

martinhoyer force-pushed the check-results branch from d7e7066 to ea54015 Compare September 30, 2024 14:44

martinhoyer commented Sep 30, 2024

View reviewed changes

tests/execute/result/check_results/main.fmf Outdated Show resolved Hide resolved

martinhoyer marked this pull request as ready for review October 1, 2024 05:53

martinhoyer requested review from psss, lukaszachy, thrix and janhavlin as code owners October 1, 2024 05:53

martinhoyer force-pushed the check-results branch from ef6b6d6 to 01a3246 Compare October 1, 2024 08:14

martinhoyer commented Oct 1, 2024

View reviewed changes

docs/releases.rst Outdated Show resolved Hide resolved

martinhoyer commented Oct 1, 2024

View reviewed changes

spec/tests/check.fmf Outdated Show resolved Hide resolved

martinhoyer commented Oct 1, 2024

View reviewed changes

spec/tests/result.fmf Outdated Show resolved Hide resolved

happz reviewed Oct 1, 2024

View reviewed changes

tmt/result.py Outdated Show resolved Hide resolved

happz reviewed Oct 1, 2024

View reviewed changes

tmt/checks/__init__.py Outdated Show resolved Hide resolved

psss added this to the 1.38 milestone Oct 1, 2024

martinhoyer removed the status | need help Extra attention is needed label Oct 1, 2024

martinhoyer force-pushed the check-results branch 2 times, most recently from 8a3d12b to 1bec7f6 Compare October 11, 2024 19:28

martinhoyer requested a review from happz October 15, 2024 08:04

martinhoyer force-pushed the check-results branch from 1bec7f6 to af49b24 Compare October 16, 2024 11:05

martinhoyer commented Oct 16, 2024

View reviewed changes

martinhoyer force-pushed the check-results branch from 231a0b3 to 9bb4a92 Compare October 16, 2024 13:55

martinhoyer requested review from seberm, psss and sbertramrh October 16, 2024 14:05

KwisatzHaderach approved these changes Oct 16, 2024

View reviewed changes

martinhoyer added the ci | full test Pull request is ready for the full test execution label Oct 16, 2024

martinhoyer commented Oct 17, 2024

View reviewed changes

tmt/steps/execute/internal.py Show resolved Hide resolved

martinhoyer added the status | need help Extra attention is needed label Oct 17, 2024

psss added the priority | must high priority, must be included in the next release label Oct 18, 2024

martinhoyer commented Oct 18, 2024

View reviewed changes

martinhoyer and others added 6 commits October 18, 2024 18:10

Implement check result key feature

12a41c4

Add check-result spec and release note

3e4af18

Move and modify check schema

d709e11

Fix doc generation

329f632

Add is_enum to plugins doc generate script

5348a2c

Fix result interpretation, notes, tests

93dbb91

martinhoyer force-pushed the check-results branch from 6dce285 to 93dbb91 Compare October 18, 2024 16:10

martinhoyer commented Oct 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement check-result: test check failure affects test result by default #3239

Implement check-result: test check failure affects test result by default #3239

martinhoyer commented Sep 25, 2024 •

edited

Loading

martinhoyer commented Sep 26, 2024 •

edited

Loading

happz commented Sep 26, 2024

martinhoyer commented Sep 30, 2024

martinhoyer commented Oct 9, 2024

happz commented Oct 9, 2024

martinhoyer commented Oct 11, 2024

martinhoyer Oct 16, 2024

sbertramrh Oct 16, 2024

martinhoyer Oct 16, 2024

happz Oct 16, 2024

happz Oct 16, 2024

martinhoyer Oct 16, 2024 •

edited

Loading

happz Oct 16, 2024

martinhoyer Oct 16, 2024

martinhoyer commented Oct 16, 2024

martinhoyer commented Oct 16, 2024

martinhoyer Oct 18, 2024

martinhoyer Oct 18, 2024

happz Oct 19, 2024

martinhoyer Oct 18, 2024

martinhoyer Oct 19, 2024


		return _result.interpret_result(invocation.test.result)
		interpret_checks = {check.how: check.result for check in invocation.test.check}

Implement check-result: test check failure affects test result by default #3239

Are you sure you want to change the base?

Implement check-result: test check failure affects test result by default #3239

Conversation

martinhoyer commented Sep 25, 2024 • edited Loading

martinhoyer commented Sep 26, 2024 • edited Loading

happz commented Sep 26, 2024

martinhoyer commented Sep 30, 2024

martinhoyer commented Oct 9, 2024

happz commented Oct 9, 2024

martinhoyer commented Oct 11, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinhoyer Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinhoyer commented Oct 16, 2024

martinhoyer commented Oct 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinhoyer commented Sep 25, 2024 •

edited

Loading

martinhoyer commented Sep 26, 2024 •

edited

Loading

martinhoyer Oct 16, 2024 •

edited

Loading