Modelling for Test Assertions #18787

jmchilton · 2024-09-06T15:32:12Z

Summary

The source of truth for the documentation around test assertions from the Galaxy Tool XSD file (including semantically important things like are parameters required and their xsd datatype) has been pulled out of the XSD and put into the Python modules via docstrings and Annotated parameters (stack overflow on how to use Annotated). This now centralized documentation and typing information is used to generate Pydantic models for assertion lists. The JSON versions of assertion lists is used by the Planemo test format (https://planemo.readthedocs.io/en/latest/test_format.html) and in experimental YAML tool definitions. Validating these is an important part of a hardened workflow tool chain.

Why not XSD?

Out-of-sync

I've always been a little uncomfortable with the assertion documentation in galaxy.xsd. The plugins were meant to be defined as isolated Python modules. Spreading out the documentation in this other format is a bit problematic because the two sources of truth can easily become out of sync - and indeed had in some ways (see bug fixes below).

Developer Familiarity

While I do think XSD is very clever and kind of wonderful to develop against - it is a bit of a dying format and is almost certainly less familiar to nearly anyone interested in developing test assertions than Python. I hope centralizing these things makes it easier for new developers to more rapidly develop and document test assertions. This point pairs well with the new test cases that make it really easy to test assertion parsing. The developer experience should be a lot better despite the assertions themselves being more generically useful.

Not Intrinsically Tied to XML

These test assertions are very readily ported to JSON and useful there. Indeed, the Pydantic models provide stronger validation and typing. For this reason, I think placing the "source of truth" documentation in XSD is a bit less than ideal.

But...

I am very confident this switch is the right choice, but I will admit there is a "but". We are losing some minor things by not using hand crafted XSD. The structured of the XSD itself and how XSD abstractions are used is a bit less than with the auto-generated code. The file is a bit bigger now and a bit more redundant. This is the nature of auto-generated code and I think is fine - the benefits of centralization described above are worth the cost and then some.

Typing Improvements

In addition to improving the documentation of the Python code, I also made a pass at making all the types more specific where it makes sense. For instance, various image assertion XSD attributes could have used the type xs:nonNegativeInteger and were instead using xs:integer. I also added validators to get similar specificity out of Pydantic. In some places, I was able to go even farther with Pydantic. One instance is the center_of_mass parameter that has the format <float>, <float>. It is trivial to validate this format in Pydantic but would likely be more difficult to validate in XSD. Additionally, the Pydantic version of validation compiles all embedded regular expressions to ensure they are correct. Despite the differences in validation quality between XML and JSON - all the declaration of typing used to do this is centralized in the module definitions themselves. Maintaining the single source of truth I would hope for from a "plugin".

Tests & Bug Fixes

I've added a file that has positive and negative validation test cases for I think every assertion test case in both JSON (for the generated pydantic) and XML (for the generated XSD). These test cases are really trivial to write - one just needs to add new snippets to either positive or negative validation lists.

These test cases found a few issues around XSD. Recursive XML testing (from asserts/xml.py ) definitions were documented in the XSD and would work in tools but I think would not validate before. Only the archive assertion was setup correctly in the XSD for these recursive assertions.

Linting the XSD

In order to do automatic code generation for the XSD well, I've auto formatted it with xmllint. The project is various familiar with code formatting and the advantages it provides for Python and Typescript - I just wanted to note that the XSD is now being formatted in the some way. The Makefile target I added is format-xsd

Applications

I've started work on Pydnatic models for the Planemo test format (as an alternative approach to galaxyproject/planemo#1417) at https://github.com/jmchilton/galaxy/pull/new/test_format. This PR doesn't include that work but it provides the infrastructure to do the hardest parts and ensure the test format stays in line and on par with the tool XML assertions over the long term. I think the validation that we do using Pydantic goes beyond what is done for XSD so I think the validation available via this approach will be stronger that that going down the XSD -> jsonschema route. JSON schema for these models can still be generated from the Pydantic using the following command:

. .venv/bin/activate; PYTHONPATH=lib python -c "from galaxy.tool_util.verify.assertion_models import assertion_list; import json; print(json.dumps(assertion_list.model_json_schema(), indent=2))"

Additionally, there are bits and pieces of progress toward dynamic tools in Galaxy (for workflows) and YAML tools. For the dynamic tools - we probably want to use validated YAML - so I think these models will be very important to ensure those things are hardened and can be maintained long term.

Resyncing

The XSD and Pydantic can be resync-d against the Python implementation of the assert plugins using.

. .venv/bin/activate; PYTHONPATH=lib python lib/galaxy/tool_util/verify/codegen.py

These validation artifacts can be validated against the new tests using the pytest command:

pytest test/unit/tool_util/test_assertion_models.py

How to test the changes?

(Select all options that apply)

I've included appropriate automated tests.
This is a refactoring of components with existing test coverage.

License

I agree to license these and all my past contributions to the core galaxy codebase under the MIT license.

bernt-matthias

Pretty cool.

bernt-matthias · 2024-09-09T16:59:28Z

lib/galaxy/tool_util/verify/assertion_models.py

+    return v
+
+
+has_line_line_description = """The full line of text to search for in the output."""


What is the intention of defining the description text separately?

I thought the generated code would read more clean if the potentially large strings were defined on their own. Probably a pretty arbitrary choice either way.

bernt-matthias · 2024-09-09T16:59:56Z

lib/galaxy/tool_util/verify/assertion_models.py

+
+has_line_max_description = """Maximum number (default: infinity), can be suffixed by ``(k|M|G|T|P|E)i?``"""
+
+has_line_negate_description = """A boolean that can be set to true to negate the outcome of the assertion."""


This description should be the same for all assertions that implement negate.

The docs have a single source of truth in the parameters types - I don't think it is worth while to try to de-duplicate the strings here. Does that make sense?

bernt-matthias · 2024-09-09T17:06:08Z

lib/galaxy/tool_util/xsd/galaxy.xsd

-      <xs:group ref="TestAssertionsJson" minOccurs="0" maxOccurs="unbounded"/>
-      <xs:group ref="TestAssertionsH5" minOccurs="0" maxOccurs="unbounded"/>
-      <xs:group ref="TestAssertionsImage" minOccurs="0" maxOccurs="unbounded"/>
+      <xs:group ref="TestAssertion"/>


Is there a way to ensure that this part is not edited?

I'll see if I can regenerated these with some comments.

Also we could put the definitions in their own file. That would probably be a better design in the abstract.

Edit: I don't know what ramifications this would have - it is nice to have a single URL to download for tooling. I'll add the comments.

I've regenerated the docs with a lot more warnings about things being auto-generated and not to modify them. I've also rearranged things so the XSD has comments about where modules are defined so the docs can be updated.

jmchilton added kind/enhancement kind/refactoring cleanup or refactoring of existing code, no functional changes area/testing area/workflows area/tool-framework labels Sep 6, 2024

jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 8fc578f to 347639a Compare September 6, 2024 21:00

bernt-matthias reviewed Sep 9, 2024

View reviewed changes

jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 7032070 to b93e8a9 Compare September 13, 2024 14:37

xmllint on galaxy.xsd

d7f2776

jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 2ca12f9 to b7e9957 Compare September 13, 2024 20:06

jmchilton added 2 commits September 13, 2024 16:23

Modeling from assertions.

b70d83d

Re-generate galaxy.xsd with assertion parameters.

90d682f

jmchilton force-pushed the assertion_overhaul branch from b7e9957 to 90d682f Compare September 13, 2024 20:23

jmchilton marked this pull request as ready for review September 15, 2024 15:48

github-actions bot added this to the 24.2 milestone Sep 15, 2024

mvdbeek approved these changes Sep 16, 2024

View reviewed changes

jmchilton merged commit 5acc518 into galaxyproject:dev Sep 17, 2024
52 of 55 checks passed

bernt-matthias mentioned this pull request Oct 8, 2024

Assert that data_column parameters have a valid data_ref #18949

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modelling for Test Assertions #18787

Modelling for Test Assertions #18787

jmchilton commented Sep 6, 2024 •

edited

Loading

bernt-matthias left a comment

bernt-matthias Sep 9, 2024

jmchilton Sep 12, 2024

bernt-matthias Sep 9, 2024

jmchilton Sep 12, 2024

bernt-matthias Sep 9, 2024

jmchilton Sep 12, 2024

jmchilton Sep 12, 2024 •

edited

Loading

jmchilton Sep 12, 2024

		return v


		has_line_line_description = """The full line of text to search for in the output."""


		has_line_max_description = """Maximum number (default: infinity), can be suffixed by ``(k\|M\|G\|T\|P\|E)i?``"""

		has_line_negate_description = """A boolean that can be set to true to negate the outcome of the assertion."""

Modelling for Test Assertions #18787

Modelling for Test Assertions #18787

Conversation

jmchilton commented Sep 6, 2024 • edited Loading

Summary

Why not XSD?

Out-of-sync

Developer Familiarity

Not Intrinsically Tied to XML

But...

Typing Improvements

Tests & Bug Fixes

Linting the XSD

Applications

Resyncing

How to test the changes?

License

bernt-matthias left a comment

Choose a reason for hiding this comment

bernt-matthias Sep 9, 2024

Choose a reason for hiding this comment

jmchilton Sep 12, 2024

Choose a reason for hiding this comment

bernt-matthias Sep 9, 2024

Choose a reason for hiding this comment

jmchilton Sep 12, 2024

Choose a reason for hiding this comment

bernt-matthias Sep 9, 2024

Choose a reason for hiding this comment

jmchilton Sep 12, 2024

Choose a reason for hiding this comment

jmchilton Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

jmchilton Sep 12, 2024

Choose a reason for hiding this comment

jmchilton commented Sep 6, 2024 •

edited

Loading

jmchilton Sep 12, 2024 •

edited

Loading