Add test mode to `run` and `dev` #962

freakboy3742 · 2022-11-08T07:50:54Z

Take 3. See #949 and #961 for the previous 2 attempts.

Adds a --test option to briefcase run and briefcase dev. When in --test mode:

the app entry point is changed from <app_name> to tests.<app_name>
the list of application sources is extended to include test_sources
the list of application dependencies is extended to include test_requires
The log runner then filters the output stream of the running app, looking for a sentinel value indicating test success/failure.

This approach has the the following advantages:

It is much faster on iOS. I don't know what Xcode is doing when it starts a test suite, but this approach is up to 2 minutes faster to start the test suite.
It requires minimal platform-specific modifications. We don't need to work out how to drive native unit test frameworks on every platform; we can just use "normal" app execution
It can be used on "app" targets, not just project targets.

The downside is that:

We lose access to any platform-specific test automation tooling. e.g., Xcode has frameworks for taking screenshots etc during a test; we won't be able to use those (at least, not easily)
We need to manually manage the process of starting an application in the background, if we want to programatically drive an application.

Implemented for:

iOS (requires the Test framework v2 briefcase-iOS-Xcode-template#13 template updates)
macOS Xcode (requires the Test framework v2 briefcase-macOS-Xcode-template#15 template updates)
macOS app (requires the Test framework v2 briefcase-macOS-app-template#11 template updates)
Linux AppImage (requires the Add support for the BRIEFCASE_MAIN_MODULE testing override. briefcase-linux-appimage-template#23 template updates)
Linux FlatPak (requires the Add support for the BRIEFCASE_MAIN_MODULE testing endpoint. briefcase-linux-flatpak-template#8 template updates)
Windows VisualStudio (requires the Add support for BRIEFCASE_MAIN_MODULE testing entry point. briefcase-windows-VisualStudio-template#5 template updates)
Android (requires the Add dynamic entry point for Android code. briefcase-android-gradle-template#56 template updates)

If you specify a template_branch='testcase2' at the app level, these templates will be picked up.

The most common example of usage will be a pytest suite, where all the test code is contained in a tests folder. This would be configured as:

test_sources = ['tests']
test_requires = ['pytest']

the code copied in as part of the test_sources should contain an app_name.py that will start the test suite. This script should contain:

import os
from pathlib import Path

import pytest


if __name__ == "__main__":
    os.chdir(Path(__file__).parent.parent)
    result = pytest.main(["-vv", "--color=no", "test"])

Alternatively, a unittest suite with automatic discovery would look like:

import unittest
from pathlib import Path

if __name__ == "__main__":
    loader = unittest.TestLoader()
    suite = loader.discover(Path(__file__).parent.parent / "tests")
    runner = unittest.TextTestRunner()
    runner.run(suite)

And a unittest suite based on explicit code imports would look like:

import unittest
from pathlib import Path

from .test_module1 import *
from .test_module2 import *

if __name__ == "__main__":
    unittest.main()

Other notes:

run --test will implicitly do a full update and build on every execution. This will resulting the test code and dependencies always being available when running as a test; however, it also means that if you do run directly after run --test, the test dependencies will be in the app. Mobile platforms will also end up in a situation where they run as the test app. So - it is advisable to do a run -u the first time after running run --test.

Fixes #973

PR Checklist:

All new features have been tested
All new features have been documented
I have read the CONTRIBUTING.md file
I will abide by the code of conduct

…S and iOS.

mhsmith · 2022-11-08T18:32:49Z

For Android, the easiest way I can think of to set the entry point is for Briefcase to create a resource file, e.g. app/src/main/res/values/briefcase.xml, containing a string resource, and for MainActivity to get this resource using getString.

The name of the file isn't significant: all files in the values directory are simply combined when building the app. But using a separate file would make it easier for Briefcase to manage it without conflicting with anything else.

…ys update to test state.

src/briefcase/commands/run.py

rmartin16

I don't have any large concerns.

The necessary configuration is a bit opaque to me; will the default templates be updated to perhaps provide a usable example of implementing tests?

I don't know if this would have changed things but Rich was already recording all the captured output. The same Rich API we use to create the log file can be used to obtain a copy of past output.

I haven't been able to get my test suite to run on Android but I'm presuming that user error for now.

Finally, I'm particularly disappointed at all the missed opportunities to use the walrus operator in this PR ;)

src/briefcase/platforms/linux/appimage.py

src/briefcase/commands/run.py

rmartin16 · 2022-11-22T21:04:40Z

src/briefcase/platforms/linux/appimage.py

        try:
+            # Set up the log stream
+            kwargs = self._prepare_log_stream(app=app, test_mode=test_mode)
+
            # Start the app in a way that lets us stream the logs
            log_popen = self.tools.subprocess.Popen(


To help draw the difference from macOS, I'd rename all these Popen objects from log_popen to app_popen. It gets confusing to me what the Popen objects really represent in the different codepaths.

This same change also probably makes sense for flatpak and windows.

src/briefcase/commands/run.py

rmartin16 · 2022-11-22T21:10:50Z

src/briefcase/commands/run.py

+            # Look for the success/failure conditions in the tail
+            if self.failure_filter and self.failure_filter(tail):
+                self.success = False
+                self.log_popen.send_signal(signal.SIGINT)
+            elif self.success_filter and self.success_filter(tail):
+                self.success = True
+                self.log_popen.send_signal(signal.SIGINT)


What's the purpose of sending SIGINT here? Why shouldn't the process finish normally?

Primarily because on iOS and Android, it won't ever finish. iOS and Android apps don't ever truly exit - it's assumed that the app will start, and will persist indefinitely; on macOS, the log stream will persist.

However, we might be able to use the stop_func in the Android/iOS case; we're already using the stop func on the macOS case. Let me dig into this a bit.

I taken another look at this, and I think I've found an alternate approach. Instead of sending SIGINT, we can tell the streamer to stop. The process will then fall back on the normal subprocess.cleanup(). We'll still potentially be terminating the process, but the logic is all contained in the subprocess module.

In the process of working this out, I was also able to build a stop_func implementation for iOS, so we now have a safety mechanism there, too. In normal operation, that safety mechanism won't be triggered; but if someone manages to build an app that fully crashes, the log will terminate along with the app, which brings iOS in alignment with every other platform.

rmartin16 · 2022-11-22T21:42:23Z

src/briefcase/commands/dev.py

        if run_app:
-            self.logger.info("Starting in dev mode...", prefix=app.app_name)
-            env = self.get_environment(app)
-            return self.run_dev_app(app, env, **options)
+            if test_mode:
+                self.logger.info(
+                    "Running test suite in dev environment...", prefix=app.app_name
+                )


The user experience for briefcase dev --test for an app that's improperly configured is a little rough.

Do you think some guardrails here and exception masking with more obvious feedback to the user is appropriate?

❯ briefcase dev --test [testmode] Running test suite in dev environment... Traceback (most recent call last): File "/home/russell/.pyenv/versions/3.10.8-debug/lib/python3.10/runpy.py", line 129, in _get_module_details spec = importlib.util.find_spec(mod_name) File "/home/russell/.pyenv/versions/3.10.8-debug/lib/python3.10/importlib/util.py", line 94, in find_spec parent = __import__(parent_name, fromlist=['__path__']) ModuleNotFoundError: No module named 'tests' The above exception was the direct cause of the following exception: Traceback (most recent call last): File "<string>", line 1, in <module> File "/home/russell/.pyenv/versions/3.10.8-debug/lib/python3.10/runpy.py", line 220, in run_module mod_name, mod_spec, code = _get_module_details(mod_name) File "/home/russell/.pyenv/versions/3.10.8-debug/lib/python3.10/runpy.py", line 138, in _get_module_details raise error(msg.format(mod_name, type(ex).__name__, ex)) from ex ImportError: Error while finding module specification for 'tests.testmode' (ModuleNotFoundError: No module named 'tests')

Hrm - I agree this isn't a great UX. However, there are some limits on what we can do here, because there's functionally no difference between user-space code raising an ImportError, and the main execution script raising an ImportError. We could catch the ImportError, and then interrogate to see if the underlying cause is the module that was used to start the app; however, that kind of interrogation makes me really nervous - being "smart" in this kind of way has burned me in the past.

However - I noticed that I'm inadvertently eating the error condition; The mainline essentially ignores BriefcaseTestSuiteError, assuming the method that raised it has done all the error handling, so when your test suite raises an error, that's not caught as a "couldn't start test suite" error. Catching that error at least gives us the chance to raise a slightly more meaningful "Unable to start test suite 'tests.myapp'" message, and capture the log; test suite success/failure is a "successful" run.

Does that look any better?

freakboy3742 · 2022-11-23T01:31:25Z

I don't have any large concerns.

The necessary configuration is a bit opaque to me; will the default templates be updated to perhaps provide a usable example of implementing tests?

That's the idea. Adding a default test suite to the briefcase template is on my todo list for today.

I don't know if this would have changed things but Rich was already recording all the captured output. The same Rich API we use to create the log file can be used to obtain a copy of past output.

It would be - except for the iOS/Android case. Since those test apps never truly exit, we need to have a streaming log filter to identify the failure conditions. Plus, it gives us the opportunity to strip off some of the excessive logging noise added by the macOS log stream.

I haven't been able to get my test suite to run on Android but I'm presuming that user error for now.

Yeah - that needs some hand-holding for now, plus a manual modification to the gradle template. @mhsmith pushed some updates last night which provide a temporary workaround for the manual modifications. If you want a working example, beeware/Python-support-testbed#6 is the best point of reference.

Finally, I'm particularly disappointed at all the missed opportunities to use the walrus operator in this PR ;)

:-P

mhsmith · 2022-11-23T14:03:38Z

I think all of my comments have been addressed, except for this one about the --no-auto-update argument.

freakboy3742 · 2022-11-24T00:50:55Z

@rmartin16 FYI - I've added beeware/briefcase-template#39; this will bootstrap a pytest test suite into a new project. The template contains an option for a unittest suite; for now, you need to manually roll out the template to get that option. We could add the extra question to this PR; but (a) I don't think it hurts to strongly encourage pytest, and (b) we really need the longer term fix in the form of a fix for #392.

freakboy3742 · 2022-11-25T00:33:44Z

I've had a quick word with @rmartin16 on Discord, and he's given the go-ahead to merge this. We're certain to have tweaks as we fine tune the testing behavior as we gain experience, but it will be easier to handle those as PRs on this base.

If nothing else: when beeware/briefcase-template#39 lands, we'll be able to extend CI to run the minimal test suite, which will give us the first proof that Briefcase is always producing an executable that can execute.

freakboy3742 added 4 commits November 8, 2022 12:03

Add the ability to filter streamed output.

3f45023

Added a --test option to run, and implemented test filtering for macO…

376dee2

…S and iOS.

Add a --test option to briefcase dev

5f20d6f

Added the ability to define test sources and requirements.

1e5e1ee

This was referenced Nov 8, 2022

Add a briefcase test command (simplified) #961

Closed

Add a briefcase test command #949

Closed

Convert testbed app to support test mode beeware/Python-support-testbed#6

Merged

Ensure packaged apps update to a production state, and test apps alwa…

2b1b32c

…ys update to test state.

Add code to pass in BRIEFCASE_MAIN_MODULE on all platforms.

547ae1b

freakboy3742 added 11 commits November 15, 2022 11:42

Merge branch 'main' into test-mode

a6fd5b7

Convert log filtering into a generic feature.

4dd329e

Add tests of subprocess stream filtering.

cf9f507

Added tests for log filtering.

120f95e

Add tests for run and dev commands in test mode.

0de579d

Factor out common app streaming logic.

1116183

Add changenote.

8fb460e

Add stream filtering to linux, windows and Android.

8f6151c

Add log filter for Android logs.

bc004a1

Edge cases in coverage.

1de44fa

Corrected path definitions for Windows.

a4f8fe2

freakboy3742 added 4 commits November 22, 2022 08:05

Removed some redundant updates implied by the run command.

a9eddde

Minor updates from review pass 2.

8474622

Changed -X to --Xemulator.

860b7cb

Expand pytest patterns to catch everything that might happen.

0a3ca7f

mhsmith reviewed Nov 22, 2022

View reviewed changes

src/briefcase/commands/run.py Outdated Show resolved Hide resolved

src/briefcase/commands/run.py Outdated Show resolved Hide resolved

rmartin16 requested changes Nov 22, 2022

View reviewed changes

Cleanups from rmartin16 review.

a6463ff

freakboy3742 added 7 commits November 23, 2022 10:40

Add PID detection and a stop function on iOS.

0ec93b7

Allow filters to terminate the log stream.

a1c7e3b

More success/failure regex cleanups.

1fe0755

Merge branch 'main' into test-mode

09fddd5

Allow for negative time (?!) in test results.

4904fef

Factor out the creation of a class name from formal name.

5b4b2da

Use a template-provided path to entitlements.

c5aa6ba

freakboy3742 requested review from rmartin16 and mhsmith November 23, 2022 05:47

freakboy3742 added 2 commits November 24, 2022 07:20

Consolidate update options into a single argparse flag.

0990317

Remove some handling of OSErrors that mask deeper problems.

77ac9eb

freakboy3742 mentioned this pull request Nov 24, 2022

Add minimal test suite with bootstrap beeware/briefcase-template#39

Merged

4 tasks

Added general docs about test mode.

f3f7b21

mhsmith approved these changes Nov 24, 2022

View reviewed changes

freakboy3742 added 2 commits November 25, 2022 06:09

More variable name clarifications for platforms that run apps directly.

4dceef2

Add a separator on briefcase dev.

18ada1d

freakboy3742 merged commit 4cac510 into beeware:main Nov 25, 2022

freakboy3742 deleted the test-mode branch November 25, 2022 00:33

freakboy3742 mentioned this pull request Nov 28, 2022

On Windows, test mode requires an extra line after the summary line #981

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test mode to `run` and `dev` #962

Add test mode to `run` and `dev` #962

freakboy3742 commented Nov 8, 2022 •

edited

Loading

mhsmith commented Nov 8, 2022

rmartin16 left a comment

rmartin16 Nov 22, 2022

rmartin16 Nov 24, 2022

rmartin16 Nov 22, 2022

freakboy3742 Nov 22, 2022

freakboy3742 Nov 23, 2022

rmartin16 Nov 22, 2022

freakboy3742 Nov 22, 2022

freakboy3742 commented Nov 23, 2022

mhsmith commented Nov 23, 2022

freakboy3742 commented Nov 24, 2022

freakboy3742 commented Nov 25, 2022

Add test mode to run and dev #962

Add test mode to run and dev #962

Conversation

freakboy3742 commented Nov 8, 2022 • edited Loading

PR Checklist:

mhsmith commented Nov 8, 2022

rmartin16 left a comment

Choose a reason for hiding this comment

rmartin16 Nov 22, 2022

Choose a reason for hiding this comment

rmartin16 Nov 24, 2022

Choose a reason for hiding this comment

rmartin16 Nov 22, 2022

Choose a reason for hiding this comment

freakboy3742 Nov 22, 2022

Choose a reason for hiding this comment

freakboy3742 Nov 23, 2022

Choose a reason for hiding this comment

rmartin16 Nov 22, 2022

Choose a reason for hiding this comment

freakboy3742 Nov 22, 2022

Choose a reason for hiding this comment

freakboy3742 commented Nov 23, 2022

mhsmith commented Nov 23, 2022

freakboy3742 commented Nov 24, 2022

freakboy3742 commented Nov 25, 2022

Add test mode to `run` and `dev` #962

Add test mode to `run` and `dev` #962

freakboy3742 commented Nov 8, 2022 •

edited

Loading