Java: Automodel: Add Candidates for Regression Testing #13954

kaeluka · 2023-08-14T10:58:37Z

This adds additional candidates that are useful for regression testing.

For regression testing, we want to also look at candidates that we used to extract and then modeled using automodel.

To achieve that, we add the candidates to the extraction query, but also add metadata so that we can skip them during non-testing uses.

…for regression testing

...emetry/AutomodelApplicationModeExtraction/AutomodelApplicationModeExtractCandidates.expected

tausbn

Short and sweet! 👍

I have a small suggestion for how to make this more flexible (in case we need it), but otherwise this looks good to me.

java/ql/src/Telemetry/AutomodelApplicationModeExtractCandidates.ql

jhelie · 2023-08-15T07:50:00Z

⚠️ This is based on another PR (#13886) that is currently under review and should be merged first. It is sufficient to review starting with commit 551b34e

drive-by comment: I do that quite often too and I find that selecting the previous PR's branch as the merge target for the second PR is quite handy to keep things clear.

jhelie · 2023-08-15T07:57:33Z

Thanks @kaeluka! Just to double check my understanding: this design does mean that during normal operation (i.e. not regression testing) we need to implement a filter in the script that will skip the candidates with an ai-{foo} provenance, correct?

In practice I imagine that the majority of candidates will not have an AI-provenance and so that skipping will be the exception (and so won't impact performance) but ooi have we also considered a 2 queries design whereby we would add a AutomodelApplicationModeExtractCandidatesRegression.ql query?

…AiModeled meta data property

kaeluka · 2023-08-16T07:28:01Z

Just to double check my understanding: this design does mean that during normal operation (i.e. not regression testing) we need to implement a filter in the automodel.py script that will skip the candidates with an ai-{foo} provenance, correct?

Yes! 👍

In practice I imagine that the majority of candidates will not have an AI-provenance and so that skipping will be the exception (and so won't impact performance) but ooi have we also considered a 2 queries design whereby we would add a AutomodelApplicationModeExtractCandidatesRegression.ql query?

I have, and have discussed that in our standup on Monday, remember? Two queries will mean increased maintenance load, even though the two queries would be almost exactly the same.

kaeluka · 2023-08-16T07:29:37Z

drive-by comment: I do that quite often too and I find that selecting the previous PR's branch as the merge target for the second PR is quite handy to keep things clear.

Thanks! I considered that but have never done it before because I wasn't sure how the UI would present that! Will do that next time ;)

Edit: now that the 'base' branch was merged, I could update this PR and it only shows the new commits as intended.

jhelie · 2023-08-16T15:50:34Z

Shall we have similar changes for framework mode in the same PR?

jhelie · 2023-08-16T15:53:16Z

Two queries will mean increased maintenance load, even though the two queries would be almost exactly the same.

It feels a little weird to always extract all candidates instead of just the ones we need (seems like the wrong away around) but we can easily modify that later if need be.

In the meantime we should probably avoiding merging this until we have the filter implemented in the script.

jhelie · 2023-08-23T13:49:23Z

I've just realised that the positive and negative extraction queries do not include the new alreadyAiModeled field. I'll try to add it in since t's more robust to have the same extracted features for examples and candidates (this is what allow us to easily classify candidates, regardless of whether or not they are known models, which can be useful for some regression testing).

edit: come to think of it I'm not entirely of the strength of the use case - we can handle the diff script side easily.

jhelie · 2023-08-23T14:15:18Z

In the meantime we should probably avoiding merging this until we have the filter implemented in the script.

This is actually not true - it will only be an issue when we attempt bumping our codeql dependency.

jhelie · 2023-08-29T08:59:32Z

Shall we have similar changes for framework mode in the same PR?

@tausbn thanks for the pointers on how to update the tests, I think the commit I just pushed took care of this - if you're happy with it I think this PR is now ready to be merged.

tausbn

As far as I can tell, all of your test cases have alreadyAiModeled set to false (as witnessed by the file://:1:1:1:1 bit that precedes them -- i.e. the empty string).

This seems... not right? I would expect at least one of the test cases to actually populate this with a non-empty string (like we do for the application mode tests, where in one case it's "ai-manual").

jhelie · 2023-08-29T13:25:03Z

No there is one that has the ai-manual provenance: de76c07 (so that's indeed the same situation as for the application mode)

See the 8th example in AutomodelFrameworkModeExtractCandidates.expected.

tausbn · 2023-08-29T13:31:02Z

No there is one that has the ai-manual provenance: de76c07 (so that's indeed the same situation as for the application mode)

See the 8th example in AutomodelFrameworkModeExtractCandidates.expected.

Ah, so there is! Never mind, then. 🙂

tausbn

jhelie · 2023-08-29T13:32:58Z

Great thanks for having a quick look, let's merge it then! 🚢

Java: Automodel application mode: include candidates that are useful …

551b34e

…for regression testing

kaeluka requested a review from a team as a code owner August 14, 2023 10:58

github-actions bot added the Java label Aug 14, 2023

kaeluka requested a review from tausbn August 14, 2023 10:59

kaeluka added the no-change-note-required This PR does not need a change note label Aug 14, 2023

Java: Automodel framework mode: use new interface

bc55afc

kaeluka commented Aug 14, 2023

View reviewed changes

...emetry/AutomodelApplicationModeExtraction/AutomodelApplicationModeExtractCandidates.expected Outdated Show resolved Hide resolved

tausbn requested changes Aug 14, 2023

View reviewed changes

java/ql/src/Telemetry/AutomodelApplicationModeExtractCandidates.ql Show resolved Hide resolved

Java: Automodel framework mode: track exact ai- provenance in already…

808dc3e

…AiModeled meta data property

Merge branch 'main' into kaeluka/add-provenance-to-metadata

44a9cf9

Java: Automodel Framework Mode: Add Candidates for Regression Testing

de76c07

jhelie requested a review from tausbn August 29, 2023 08:59

jhelie changed the title ~~Java: Automodel Application Mode: Add Candidates for Regression Testing~~ Java: Automodel: Add Candidates for Regression Testing Aug 29, 2023

tausbn requested changes Aug 29, 2023

View reviewed changes

jhelie requested a review from tausbn August 29, 2023 13:25

tausbn approved these changes Aug 29, 2023

View reviewed changes

jhelie merged commit 41726f5 into main Aug 29, 2023

jhelie deleted the kaeluka/add-provenance-to-metadata branch August 29, 2023 13:33

Java: Automodel: Add Candidates for Regression Testing #13954

Java: Automodel: Add Candidates for Regression Testing #13954

Uh oh!

Conversation

kaeluka commented Aug 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tausbn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jhelie commented Aug 15, 2023

Uh oh!

jhelie commented Aug 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaeluka commented Aug 16, 2023

Uh oh!

kaeluka commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhelie commented Aug 16, 2023

Uh oh!

jhelie commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhelie commented Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhelie commented Aug 23, 2023

Uh oh!

jhelie commented Aug 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tausbn left a comment

Choose a reason for hiding this comment

Uh oh!

jhelie commented Aug 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tausbn commented Aug 29, 2023

Uh oh!

tausbn left a comment

Choose a reason for hiding this comment

Uh oh!

jhelie commented Aug 29, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kaeluka commented Aug 14, 2023 •

edited

Loading

jhelie commented Aug 15, 2023 •

edited

Loading

kaeluka commented Aug 16, 2023 •

edited

Loading

jhelie commented Aug 16, 2023 •

edited

Loading

jhelie commented Aug 23, 2023 •

edited

Loading

jhelie commented Aug 29, 2023 •

edited

Loading

jhelie commented Aug 29, 2023 •

edited

Loading