Add script to run all tests that take less than 1 min each #39

jafingerhut · 2025-01-21T18:46:20Z

This PR also adds the --verbosity DEBUG option to CI builds, which I expect will help diagnose future CI build failures.

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Also add a note to the README about how it seems that VirtualBox VMs can run Tofino tests faster if you reduce them to only 1 VCPU. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

…ne-tofino1-test

fruffy · 2025-02-07T14:56:59Z

ci/run-one-test.sh

+timeout 10800 ./run_p4_tests.sh -p ${TESTNAME} --arch tofino |& sed 's/^/tests: /'
+
+echo "Killing bf_switchd and tofino-model processes ..."
+sudo killall bf_switchd


This will break any parallel setup. I would only kill the process returned by the shell.

Python has better facilities for this imho and ChatGPt etc works really well for these kinds of boilerplate scripts.

@vgurevich Have you run multiple Tofino models and bf_switchd processes on the same base OS in parallel successfully? Without containers or VMs or other things like that to separate them from each other?

If that works, great.

I tried using bash mechanisms to capture the PID of the sudo run_*.sh runs, but killing those only killed the sudo process, not the run_*.sh script running as root, so was not effective.

Note: The existing tests that I am aware expect that a set of veth interfaces have been created first, before the test starts running, and the test assumes that whichever of those veth interfaces it wants to use, are available for its sole use.

Thus any two tests that both use veth2 will conflict with each other if you attempt to run them both without Linux network namespaces or other tricks like that.

And that is assuming that the Tofino model and driver processes don't conflict with each other in other ways besides this, which might be the case, but it seems like a trip down the rabbit hole to run parallel tests on the same system. If you really want to do testing in parallel, it would be far easier to set up and easier to maintain if you divided up the tests N ways and ran the different subsets of all tests on N different systems in parallel.

@fruffy I am fairly sure it is possible to enable running multiple tests in parallel on a system, but doing so would be significantly more development time to create than what is in this PR. It isn't just replacing the uses of killall. I added comments to these trying to make it very explicit that these scripts only support running one test at a time on a system.

I added another script that runs all Tofino1 and Tofino2 tests that take at most about 5 minutes of time each, which is most of them. They take about 90 minutes to run. If I enabled all of the tests, the longest 5 or 6 would push the total up to about 5 hours. This seems like a reasonable length of test suite to run nightly or weekly, rather than pre-commit.

Great!

We definitely had parallel scripts for the Tofino PTF tests using network namespaces. Which is why I was concerned about this command (if we end up using these commands it for the tests).

…ne-tofino1-test

Also add a script that runs all Tofino1 and Tofino2 tests. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

…ino1-test

failing the run-one-test.sh script if that does happen. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

fruffy · 2025-04-10T12:46:55Z

ci/run-multiple-tests.sh

@@ -0,0 +1,318 @@
+#! /bin/bash


Can we try to combine this with #94? I am curious to see whether we can now run tests in parallel.

I made a commit a moment ago to update this Bash script to use your new run-test.py program to run individual tests. We'll see how it goes.

…ino1-test

…gerhut/open-p4studio into add-script-to-run-one-tofino1-test

…l tests Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut · 2025-04-11T03:55:16Z

@fruffy In the logs for the ubuntu-22.04 test running in CI, you can download them and then do: egrep '(ERROR|exit_status)' log-file and I see output like this:

2025-04-11T02:07:55.2865294Z Test ARCH=tofino exit_status=0 p4_14 basic_switching
2025-04-11T02:08:19.9937195Z Test ARCH=tofino exit_status=0 p4_16 bri_handle
2025-04-11T02:08:24.5327170Z 2025-04-11 02:08:24,532 - INFO - [switchdOutputThread] - switchd: 2025-04-11 02:08:24.532227 BF_SWITCHD ERROR - ERROR: bf_sys_dma_pool_create failed(-1) for dev_id 0 subdev_id 0 pool BF_DMA_CPU_PKT_RECEIVE_0_dev_0_0_Pool
2025-04-11T02:08:24.8915089Z 2025-04-11 02:08:24,891 - ERROR - [MainThread] - Process [switchd] (PID: 119763) is not alive (rc: 1).
2025-04-11T02:08:28.9256753Z Test ARCH=tofino exit_status=1 p4_16 bri_with_pdfixed_thrift
2025-04-11T02:08:34.5193524Z 2025-04-11 02:08:34,518 - INFO - [switchdOutputThread] - switchd: 2025-04-11 02:08:34.518823 BF_SWITCHD ERROR - ERROR: bf_sys_dma_pool_create failed(-1) for dev_id 0 subdev_id 0 pool BF_DMA_CPU_PKT_RECEIVE_0_dev_0_0_Pool
2025-04-11T02:13:35.8594007Z 2025-04-11 02:13:35,859 - ERROR - [MainThread] - Tests failed with exit code 1.
2025-04-11T02:13:35.8597081Z 2025-04-11 02:13:35,859 - ERROR - [MainThread] - Process [switchd] (PID: 121989) is not alive (rc: 1).
2025-04-11T02:13:39.8877294Z Test ARCH=tofino exit_status=1 p4_14 chksum
2025-04-11T02:13:44.2712849Z 2025-04-11 02:13:44,270 - INFO - [switchdOutputThread] - switchd: 2025-04-11 02:13:44.270755 BF_SWITCHD ERROR - ERROR: bf_sys_dma_pool_create failed(-1) for dev_id 0 subdev_id 0 pool BF_DMA_CPU_PKT_RECEIVE_0_dev_0_0_Pool
2025-04-11T02:13:44.8150662Z 2025-04-11 02:13:44,814 - ERROR - [MainThread] - Process [switchd] (PID: 125311) is not alive (rc: 1).
2025-04-11T02:13:48.8496136Z Test ARCH=tofino exit_status=1 p4_14 default_entry
2025-04-11T02:13:53.3384067Z 2025-04-11 02:13:53,338 - INFO - [switchdOutputThread] - switchd: 2025-04-11 02:13:53.337969 BF_SWITCHD ERROR - ERROR: bf_sys_dma_pool_create failed(-1) for dev_id 0 subdev_id 0 pool BF_DMA_CPU_PKT_RECEIVE_0_dev_0_0_Pool
2025-04-11T02:13:53.9365409Z 2025-04-11 02:13:53,936 - ERROR - [MainThread] - Process [switchd] (PID: 127537) is not alive (rc: 1).
2025-04-11T02:13:57.9648178Z Test ARCH=tofino exit_status=1 p4_14 deparse_zero

Every line with exit_status is output by my bash script named run-multiple-tests.sh, after each run of one test.

The other lines with errors about bf_sys_dma_pool_create failing, I never see those when I run these same tests on my local Ubuntu 22.04 VM. I do not know yet what causes those errors, but from the log files here, it appears that whenever that happens, it is causing the test to fail.

Most of these tests pass on my local system. I see 11 out of 109 tests failing on my local system, vs. 107 out of 109 failing in CI.

jafingerhut added 12 commits January 15, 2025 20:18

Enable batch-install.sh to run on Ubuntu 22.04

39d35f3

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Merge remote-tracking branch 'upstream/main' into main

ecc22c1

Merge branch 'main' of https://github.com/p4lang/open-p4studio into main

5f5c312

Merge branch 'main' of https://github.com/p4lang/open-p4studio

113e4f1

Merge branch 'main' of github.com:jafingerhut/open-p4studio

4becb30

Merge branch 'main' of https://github.com/p4lang/open-p4studio

d94cdd5

Merge branch 'main' of https://github.com/p4lang/open-p4studio

5a28bf0

Add a simple bash script to run one Tofino1 test on the model

db290bb

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Merge remote-tracking branch 'upstream/main'

14807ca

Merge branch 'main' into add-script-to-run-one-tofino1-test

b801b5a

Add DEBUG verbosity to CI builds for more details of failures

25c26f3

Also add a note to the README about how it seems that VirtualBox VMs can run Tofino tests faster if you reduce them to only 1 VCPU. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Merge remote-tracking branch 'upstream/main' into add-script-to-run-o…

72e5ca0

…ne-tofino1-test

fruffy reviewed Feb 7, 2025

View reviewed changes

jafingerhut added 3 commits February 27, 2025 01:32

Merge remote-tracking branch 'upstream/main' into add-script-to-run-o…

6a24ef6

…ne-tofino1-test

Enable run-one-test.sh to run Tofino2 as well as Tofino1 tests

264096a

Also add a script that runs all Tofino1 and Tofino2 tests. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Try running all short and medium tests in CI

cd7f3c5

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut changed the title ~~Add script to run one tofino1 test~~ Add scripts to run one Tofino1 or 2 test, and to run all tests that take less than 5 mins each Feb 27, 2025

Fix typo

94a216d

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

fruffy approved these changes Feb 28, 2025

View reviewed changes

jafingerhut mentioned this pull request Apr 2, 2025

Move the assembler to P4C. #70

Merged

jafingerhut added 2 commits April 2, 2025 13:58

Merge remote-tracking branch 'up/main' into add-script-to-run-one-tof…

c4cfa8a

…ino1-test

Add check for run_tofino_model.sh process exiting quickly

703f6c3

failing the run-one-test.sh script if that does happen. Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut mentioned this pull request Apr 3, 2025

Add the Tofino model source code as a package in the open P4Studio #65

Merged

4 tasks

fruffy reviewed Apr 10, 2025

View reviewed changes

jafingerhut added 5 commits April 10, 2025 20:33

Merge remote-tracking branch 'up/main' into add-script-to-run-one-tof…

7ce2d5b

…ino1-test

Merge branch 'add-script-to-run-one-tofino1-test' of github.com:jafin…

e6e0452

…gerhut/open-p4studio into add-script-to-run-one-tofino1-test

Change running of multiple tests to use new run-test.py for individua…

c5b4cfc

…l tests Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Fix typo and make arch an optional arg for run-test.py

1601cd1

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

Change run-multiple-tests.sh to have non-0 exit status if any tests fail

2829aca

Signed-off-by: Andy Fingerhut <andy_fingerhut@alum.wustl.edu>

jafingerhut changed the title ~~Add scripts to run one Tofino1 or 2 test, and to run all tests that take less than 5 mins each~~ Add script to run all tests that take less than 5 mins each Apr 11, 2025

Merge branch 'main' into add-script-to-run-one-tofino1-test

ca50e6e

jafingerhut changed the title ~~Add script to run all tests that take less than 5 mins each~~ Add script to run all tests that take less than 1 min each Apr 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add script to run all tests that take less than 1 min each #39

Add script to run all tests that take less than 1 min each #39

Uh oh!

jafingerhut commented Jan 21, 2025 •

edited

Loading

Uh oh!

fruffy Feb 7, 2025

Uh oh!

jafingerhut Feb 7, 2025

Uh oh!

jafingerhut Feb 26, 2025

Uh oh!

jafingerhut Feb 27, 2025

Uh oh!

fruffy Feb 28, 2025

Uh oh!

fruffy Apr 10, 2025

Uh oh!

jafingerhut Apr 10, 2025

Uh oh!

jafingerhut commented Apr 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add script to run all tests that take less than 1 min each #39

Are you sure you want to change the base?

Add script to run all tests that take less than 1 min each #39

Uh oh!

Conversation

jafingerhut commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fruffy Feb 7, 2025

Choose a reason for hiding this comment

Uh oh!

jafingerhut Feb 7, 2025

Choose a reason for hiding this comment

Uh oh!

jafingerhut Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

jafingerhut Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

fruffy Feb 28, 2025

Choose a reason for hiding this comment

Uh oh!

fruffy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

jafingerhut Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

jafingerhut commented Apr 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jafingerhut commented Jan 21, 2025 •

edited

Loading