New test system #262

guygastineau · 2022-06-10T04:57:10Z

This is a pretty huge PR. The new test system is more robust, it has better output for local testing, and it provides enough info for our new test runner (not merged yet check exercism/scheme-test-runner#32) to output fully v2 compatible test results.

This track was neglected for a long time. Part of that is my fault. I tried to break as few things as possible while getting things up-to-date. Some of the track documentation is out of date due to this PR, but we already had bit-rot in our docs. It sounds like some community members are interested in helping with documentation. We'll see how that goes. At the very least, I will try to get some of the nitty gritty details hammered out in some better documentation.

I squashed repetitive actions like working out dual compatibility for a few exercises, and I did some LARGE updates via automation, so these 5 commits are touching a lot of files. The test-migration.ss probably doesn't need to be used again exactly, but it has some useful procedures. If nothing else, it will have its useful parts absorbed into outils.ss and track.ss. For now, I think it is fine to leave it in the repository.

I hope this is somewhat self-explanatory. I tried to leave good commit messages, but I worked out the majority of the updated test facility, ie. test-util.ss, while tinkering with my own forth solution, so I don't have commit message for that process here. The majority of the code is really the same. I made some tweaks for robustness and better reporting, and now the tests are all accepted as lists that get evaled with a reflected appened for test_code reporting. In future, I expect we will just use rest args to pack in the source and optional task relations, but adding this later won't be any harder than adding it now, so I am putting it off.

test-util.ss modifies the original test running code. It is isolated from the spec descriptions (maybe it should be a real library eventually?), and it provides enough information to satisfy the newer requirements for test-runner reporting (for v3 of the runner spec). This also has better usability for the local tests. By default the actual value returned by user defined functions is presented for failing tests, so you can see what is happening without re-entering everything in an repl. It could be extended to be repl friendly as well for users who want to stay in geiser for example. All exception should also be caught and converted to lists that are returned with failures (except for test-error). This should put an end to the mysterious failing tests with unhelpful exception errors. Now, so long as the user code loads it will return all errors encountered during tests as the actual value in the failure report. Maybe we should actually wrap (load solution) with an exception handler to try making better reporting on failing user code? This might be very helpful for recovering in such cases for the test-runner repo. Finished with first draft test-migration.ss modify ci.ss to run all directory tests easily. Add a procedure to redeploy test-util.ss to all exercises.

In a recent PR I approved work that reimplemented existing solutions that didn't work with our current infrastructure. Our CI used an exercise list kept statically in the Makefile, but it was already out of date. The new CI runs for all practice exercises (with a blacklist), so these exercises weren't tested in the CI for the PRs. This compound commit contains the fixes needed by these recent exercises to pass the new CI for both guile and chez. Make sum-of-multiples work for both guile and chez. Make the strain example solution work for both guile and chez. Make the trinary example solution work for both guile and chez. Make the triangle example solution work for both guile and chez.

The CI now runs on all exercises except for the deprecated list-ops and robot-name and the guile-only acronym. All of these changes only work against the tests I am about to commit, but i figured these changes would be better off isolated.

I used configlet to format the exercise configs too after the changes. At this point I believe everything is read to go except for the fact that the test-runner hasn't been updated yet. We should merge the test-runner PR before mergin this.

ErikSchierboom

I don't really know Scheme, so I'll your judgment. Thanks for the huge PR!

guygastineau · 2022-06-16T12:54:02Z

Thank you for the quick review, @ErikSchierboom . Luckily, the new test-runner has already helped the old tests pass where it was crashing before when the implementation wasn't compatible with one of the scheme implementations. Now we will get real v2 reports from these new test and our test runner.

guygastineau added 5 commits June 9, 2022 23:52

Final tweaks for the CI.

f4e7d17

The CI now runs on all exercises except for the deprecated list-ops and robot-name and the guile-only acronym. All of these changes only work against the tests I am about to commit, but i figured these changes would be better off isolated.

Migrate all exercises with the test-migration.ss code.

50df62a

guygastineau force-pushed the new-test-system branch from 5d75611 to 3400047 Compare June 10, 2022 05:02

This was referenced Jun 11, 2022

Errors in the auto testing system, submission always fails #259

Closed

Results v3 exercism/scheme-test-runner#32

Merged

guygastineau requested a review from jitwit June 15, 2022 11:43

ErikSchierboom approved these changes Jun 16, 2022

View reviewed changes

ErikSchierboom added the x:rep/large Large amount of reputation label Jun 16, 2022

guygastineau merged commit 2064dd5 into exercism:main Jun 16, 2022

guygastineau deleted the new-test-system branch June 16, 2022 12:56

ErikSchierboom mentioned this pull request Jun 16, 2022

New breakage! exercism/scheme-test-runner#33

Closed

This was referenced Jul 14, 2022

Document reputation label exercism/docs#347

Merged

Consider correct reputation amounts for different labels exercism/exercism#6440

Open

Consider correct reputation amounts for different labels exercism/exercism#6441

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New test system #262

New test system #262

Uh oh!

guygastineau commented Jun 10, 2022

Uh oh!

ErikSchierboom left a comment

Uh oh!

guygastineau commented Jun 16, 2022

Uh oh!

Uh oh!

Uh oh!

New test system #262

New test system #262

Uh oh!

Conversation

guygastineau commented Jun 10, 2022

Uh oh!

ErikSchierboom left a comment

Choose a reason for hiding this comment

Uh oh!

guygastineau commented Jun 16, 2022

Uh oh!

Uh oh!