-
Notifications
You must be signed in to change notification settings - Fork 19
Add @EXPECTED_RESULTS@ tag. #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Could be worth coming up with a syntax that naturally extends to multiple test groups. We have an unofficial tool https://github.com/nordicolympiad/testdata_tools/pull/13/files that parses
|
Unless there's a good reason to change, I'd prefer to stick to this name as it is already in use in DOMjudge and also by some problem setters. |
We should use short hand AC/WA/RTE/TLE |
drop mixed result |
This tag implies that the submission may get any of the listed verdicts as | ||
final verdict. | ||
|
||
If `@EXPECTED_RESULTS@: ` is found in a submission in any of the other |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This doesn't match the @EXPECTED_VERDICTS@
above.
@RagnarGrootKoerkamp I think we agreed on calling the mixed-bag-directory Mind bumping this PR (and fixing @eldering 's comment) and perhaps @niemela can take a look? |
So the semantics would be:
What about submissions that are WA or AC based on randomness? Should they be in accepted or rejected? |
When are they useful?
Feels like that should be |
I would just dump such a submission in |
Replying to your original question, I agree that those are the semantics we want. I'd say to only allow it in |
Regarding having Reasons for doing this:
It would be nice to officially support this. That's as easy as allowing |
I am late to this conversation; and much less experienced than many of you. (I have, however, given this topic some thought, and used my own script at analyzetestgroups.py quite a bit. It focusses on Here’s an idea for a very different approach. It
The idea is that the I want to be able to specify the allowed verdicts per submission and per named test group. (An allowed verdict can be a list, such as If one right, this allows arbitrarily fine-grained specification (say, specifying that this submission is guaranteed to fail on sample 3, but gets AC for the rest of the sample group.) This would work (and be useful), independently of “we use test groups internally to organise test cases logically during development for pass/fail problems” or “we use test groups for graded problems with scoring and expose them to the solver.” Here is an example, using fictional (and not complete thought-through) syntax: submissions:
accepted/th.py:
sample: AC # redundant bc AC is default
secret: AC # redundant
partially_accepted/quadratic_time.py:
secret:
huge: [TLE, RTE] # times out on huge testcases in data/secret/huge
*: AC # may be redundant as well
partially_accepted/simple_graph.py:
sample:
'3': WA # whereas '1' and '2' are implicitly AC
secret:
non-simple: WA
with-loops: WA
*: AC # redundant
partially_accepted/greedy.py # typical submission in a 4-testgroup scoring/grading problem
secret:
group1: AC
group2: AC
group3: WA
group4:
- TLE
- RTE For scoring problems with grade |
Will close this. @thorehusfeldt is working on an updated suggestion. |
As discussed.
Suggestions on wording are welcome. I think the directory name should be one of:
mixed_results
mixed_verdicts
multiple_results
(although only one would still be OK)multiple_verdicts