-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Regression Test Failure Comment #10682
Conversation
Success status of the run: Commit: 6efb166, The full report is available as an artifact. |
239dc06
to
e3f020c
Compare
Success status of the run: Failed Commit: 905d413, The full report is available as an artifact. |
Status of the run: Failed Commit: e03333a, The full report is available as an artifact. |
Status of the run: Succeeded Commit: e03333a, The full report is available as an artifact. Dataset:
|
Status of the run: Failed Commit: b389ec1, The full report is available as an artifact. |
Status of the run: Failed Commit: b389ec1, The full report is available as an artifact. Dataset:
|
Status of the run: Failed Commit: 08eb052, The full report is available as an artifact. Dataset:
|
Status of the run: Failed Commit: 7a1c76b, The full report is available as an artifact. |
Hey @markus-hinsche! 👋 To run model regression tests, comment with the Tips 💡: The model regression test will be run on Tips 💡: Every time when you want to change a configuration you should edit the comment with the previous configuration. You can copy this in your comment and customize:
|
/modeltest include:
- dataset: ["financial-demo", "helpdesk-assistant"]
config: ["Sparse + BERT + DIET(seq) + ResponseSelector(t2t)"] |
The model regression tests have started. It might take a while, please be patient. Used configuration can be found in the comment. |
Status of the run: Failed Commit: 7a1c76b, The full report is available as an artifact. Dataset:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 🍏
@@ -675,9 +675,25 @@ jobs: | |||
needs: | |||
- model_regression_test_cpu | |||
- model_regression_test_gpu | |||
if: always() && (needs.model_regression_test_cpu.result == 'success' || needs.model_regression_test_gpu.result == 'success') | |||
if: ((needs.model_regression_test_cpu.result != 'skipped') != (needs.model_regression_test_gpu.result != 'skipped')) && always() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why the "!=" instead of "||" here? (Guess doesn't make a difference anyway because it's either cpu or gpu who are executed and so both will never be true at the same time)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
||
gives OR behavior, while !=
gives an XOR behavior.
Explanation:
A run starts just to read the comments, but doesn't do anything else, e.g., https://github.com/RasaHQ/rasa/runs/4802907417?check_suite_focus=true.
If we write ||
, the combine_reports
would start (I ran into this issue in an earlier commit in the PR at hand). This is also connected to how the always() expression works.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah ok 👍 (what I had meant is that xor and or would effectively be the same because it seemed that both would not be non-skipped at the same time)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@markus-hinsche can we merge this then? :)
Motivation: Since #10574, summary comments after a MR test run are gone, if a job is failing
Proposed changes:
status:model-regression-tests
andrunner: gpu
) are now removed in both cases: success and failureresult.json
artifact contains (only) results from those jobs that succeededset_job_success_status
job to set the overall run status to failed if one of the jobs failedChecked:
Status (please check what you already did):
black
(please check Readme for instructions)