feat(type-coverage-generation): Model type coverage batch generation #390

sam-or · 2023-09-27T01:44:17Z

Pull Request Checklist

New code has 100% test coverage
(If applicable) The prose documentation has been updated to reflect the changes introduced by this PR
(If applicable) The reference documentation has been updated to reflect the changes introduced by this PR
Pre-Commit Checks were ran and passed
Tests were ran and passed

Description

This PR implements an alternate batch generation process. The goal is to generate a minimal set of examples of a model that achieves full coverage of the forms that model can take.
A very simple example:

class Model(pydantic.BaseModel):
  data: int | str
  
list(ModelFactory.coverage())
# >>>
# [Model(data=1234), Model(data="abc123")]

Close Issue(s)

…ration

JacobCoffee · 2023-09-27T01:53:05Z

Please see the suggested sourcery refactorings, it cant merge into forks for some reason:
61deae0

sam-or · 2023-09-27T02:12:17Z

Please see the suggested sourcery refactorings, it cant merge into forks for some reason: 61deae0

done

guacs · 2023-09-30T06:56:25Z

@sam-or is this ready for review or are you still working on it?

sam-or · 2023-09-30T07:58:12Z

@sam-or is this ready for review or are you still working on it?

There a still a couple of tests failing that I won’t get to look at until mid next week. But other than fixing those there’s hopefully not too much else that needs doing, so it should be good for review.

sam-or · 2023-10-02T21:35:13Z

tests seem to be passing now, should be good for review

guacs · 2023-10-03T01:47:53Z

tests seem to be passing now, should be good for review

Could you merge from main and run pdm run lint?

guacs · 2023-10-04T03:24:51Z

@sam-or Sorry for the delay. I'll take a look this weekend :)

polyfactory/factories/base.py

polyfactory/utils/model_coverage.py

tests/test_type_coverage_generation.py

guacs · 2023-10-08T09:33:53Z

I did review the code and I think it's great, but I'll be honest, I'm not sure I'm seeing the benefit of this feature too much. The reason for that is because once #397 is done, then hypothesis will do a form of this kind of coverage. Also, what is the expected output of the following:

@dataclass
class Bar:
    bar_val: int | str | bool


@dataclass
class Foo:
    foo_val: int | str
    bar: Bar


FooFactory = DataclassFactory.create_factory(Foo)
coverage = list(FooFactory.coverage())

print(len(coverage)) # current output = 3

Currently, the output is 3, but I'm expecting there to be 6. That is, with bar_value of bar in Foo having int, str, and bool when the type of foo_val is int and then the same when the type of foo_val is str. For more complex models, the number of variations will increase very quickly. Or am I misunderstanding the intent of this feature?

sam-or · 2023-10-08T10:45:49Z

Thank you very much for your review. I realise I have probably not explained the intention very well. The reason for wanting this feature over something like hypothesis (which I am currently using) is that I wanted something that would always generate examples of a model with every option of what that type could be, every single time that a batch is generated. The other goal is to achieve this with the minimum number of examples in a batch. With hypothesis this is not guaranteed, nor is it guaranteed with other methods based on randomness. Tools like hypothesis are amazing but I don't wish to rely on a statistical approach to this "coverage", which is why I wanted to move away from fuzzing to a more targeted method to generating the kind of test data that is best for my use case. Please let me know what your thoughts are on this, perhaps this feature is a bit niche so I understand your reservations.

So to answer your question about expected output, we would be expecting 3 examples because the highest variation of any model in your example is bar_val: int | str | bool:

Foo(foo_val=123, bar_var=Bar(bar_val=321)) # foo_val: int, bar_val: int
Foo(foo_val="abc", bar_var=Bar(bar_val="def")) # foo_val: str, bar_val: str
Foo(foo_val=456, bar_var=Bar(bar_val=True)) # foo_val: int (wraps around), bar_val: bool

As you can see it covers every value in the union types of foo_val and bar_val with the smallest number of examples - not considering every permutation of the union types which would yield 6 examples (and yes grow very quickly for complex models)

guacs · 2023-10-08T12:27:31Z

Aah okay now I get what you were trying to do. I do think it's helpful, but also like you said it might be a bit too niche of a feature for us to merge and then maintain.

Thoughts, @litestar-org/members?

sam-or · 2023-10-08T23:25:09Z

Perhaps to try to sell it a bit more I'll try to explain the benefits I see in this feature;

There is a testing speed advantage with larger more complex models to generating a minimal number of examples that still achieves a high percentage of code coverage
Consistency in testing, I'm sure anyone who has used hypothesis enough has run into flaky tests and this also aims to resolve that
Useful for loading up a test database with data to test things like searching and migrations with more consistent coverage and confidence that data in every form has been tested (for some deeply nested models, it is very unlikely that hypothesis will generate examples that cover all the branches in the nested types)

I do hope that others might find it as useful as I will.

I'm also more than happy to continue to spend time on this in the future, to help maintain and improve it

guacs · 2023-10-18T11:35:00Z

@sam-or is this ready? If so, could you add documentation for this as well??

sam-or · 2023-10-18T21:32:05Z

Yes it is, I'll add documentation now

guacs · 2023-10-21T02:59:39Z

@sam-or sorry for the delay! I wanted to take another look into this properly and I'll definitely do it in a few days :)

guacs

@sam-or first of all, sorry for the delay! I have left a few comments and once those are resolved, I think this is good to merge :)

docs/examples/model_coverage/test_example_1.py

docs/examples/model_coverage/test_example_2.py

docs/usage/model_coverage.rst

polyfactory/factories/base.py

polyfactory/utils/model_coverage.py

tests/test_type_coverage_generation.py

- Add missing docstrings - Move error handling around CoverageContainerCallable to inside - Formatting issue in documentation

polyfactory/utils/model_coverage.py

guacs

Thanks for this! Just one small comment regarding adding docstrings. Also, it'd be great if you could just tag me or request another review once you've made the changes. If not, I might not know whether the PR is ready without manually looking to see if it's ready for another review.

polyfactory/utils/model_coverage.py

github-actions · 2023-11-12T06:10:25Z

Documentation preview will be available shortly at https://litestar-org.github.io/polyfactory-docs-preview/390

sam-or · 2023-11-12T06:14:01Z

Thanks for this! Just one small comment regarding adding docstrings. Also, it'd be great if you could just tag me or request another review once you've made the changes. If not, I might not know whether the PR is ready without manually looking to see if it's ready for another review.

Ah yep no worries, will do. I've added that docstring to CoverageContainerCallable, hopefully it's good to go now?

guacs

@sam-or sorry for the delay and thank you for the work you've done :)

sam-or added 2 commits September 27, 2023 01:19

feat(type-coverage-gen): Initial implementation of type coverage gene…

d689ce0

…ration

fix: revert change to .pre-commit-config.yaml

84c9c51

sourcery-ai bot mentioned this pull request Sep 27, 2023

feat(type-coverage-generation): Model type coverage batch generation (Sourcery refactored) #391

Closed

sam-or added 2 commits September 27, 2023 01:54

fix: Update NoneType importing for older python versions

e97b1c1

fix: apply sourcery refactor

8df692f

sam-or added 7 commits September 27, 2023 03:43

fix: import ParamSpec from typing_extensions

9734172

fix: Skip tests on py versions < 3.10

373fea4

fix: revert changes to .pre-commit-config.yaml

ae38e54

chore: Create devcontainer.json

1fb0608

fix: remove .devcontainer dir

f2289d0

fix: Add missing test skip for older python versions

d9adc27

test: Add test for post generated in coverage generation

20f4813

sam-or marked this pull request as ready for review September 30, 2023 07:58

sam-or requested review from a team as code owners September 30, 2023 07:58

sam-or added 3 commits October 3, 2023 02:11

Merge remote-tracking branch 'upstream/main' into coverage

176689a

test: Simplify type coverage generation tests

7f71339

test: Add back min python3.10 version condition

f93a498

guacs reviewed Oct 8, 2023

View reviewed changes

sam-or added 2 commits October 16, 2023 03:55

Merge remote-tracking branch 'upstream/main' into coverage

2c0a71a

fix: revert pre-commit conf change

deb72a1

sam-or force-pushed the coverage branch from e55dddf to deb72a1 Compare October 16, 2023 04:56

sam-or added 3 commits October 18, 2023 22:30

docs(type-coverage-gen): Add docs for coverage gen

85e2525

docs: Fix formatting in coverage docs

88923de

Merge branch 'main' into coverage

006307d

Merge branch 'main' into coverage

a304a99

guacs requested changes Oct 31, 2023

View reviewed changes

sam-or added 11 commits November 2, 2023 01:21

docs: Move profile coverage exmaple into test func

2df1a26

docs: Update social group example to use test func

f989775

fix: Address review comments

b337bc0

- Add missing docstrings - Move error handling around CoverageContainerCallable to inside - Formatting issue in documentation

test: Remove 3.10 requirement for coverage tests

be5e712

test: Move CustomInt definition outside of test

eb075e0

test: disable ruff UP006 in test file

5299535

test: fix social group test in docs example

93f8658

test: fix social group test in doc example

d70c526

test: Change hint dict to Dict in coverage test

d6e2ce8

test: Fix tuple annotation in coverage tests

d0cf706

Merge branch 'main' into coverage

3201015

JacobCoffee reviewed Nov 6, 2023

View reviewed changes

polyfactory/utils/model_coverage.py Outdated Show resolved Hide resolved

chore: fix formatting in docstring

ba355e5

guacs requested changes Nov 12, 2023

View reviewed changes

polyfactory/utils/model_coverage.py Show resolved Hide resolved

chore: Add docstring to CoverageContainerCallable

9c84551

guacs approved these changes Nov 12, 2023

View reviewed changes

guacs merged commit b1e8b5e into litestar-org:main Nov 12, 2023
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(type-coverage-generation): Model type coverage batch generation #390

feat(type-coverage-generation): Model type coverage batch generation #390

sam-or commented Sep 27, 2023 •

edited

Loading

JacobCoffee commented Sep 27, 2023

sam-or commented Sep 27, 2023

guacs commented Sep 30, 2023

sam-or commented Sep 30, 2023

sam-or commented Oct 2, 2023

guacs commented Oct 3, 2023

guacs commented Oct 4, 2023

guacs commented Oct 8, 2023 •

edited

Loading

sam-or commented Oct 8, 2023 •

edited

Loading

guacs commented Oct 8, 2023

sam-or commented Oct 8, 2023 •

edited

Loading

guacs commented Oct 18, 2023

sam-or commented Oct 18, 2023

guacs commented Oct 21, 2023

guacs left a comment

guacs left a comment

github-actions bot commented Nov 12, 2023

sam-or commented Nov 12, 2023

guacs left a comment

feat(type-coverage-generation): Model type coverage batch generation #390

feat(type-coverage-generation): Model type coverage batch generation #390

Conversation

sam-or commented Sep 27, 2023 • edited Loading

Pull Request Checklist

Description

Close Issue(s)

JacobCoffee commented Sep 27, 2023

sam-or commented Sep 27, 2023

guacs commented Sep 30, 2023

sam-or commented Sep 30, 2023

sam-or commented Oct 2, 2023

guacs commented Oct 3, 2023

guacs commented Oct 4, 2023

guacs commented Oct 8, 2023 • edited Loading

sam-or commented Oct 8, 2023 • edited Loading

guacs commented Oct 8, 2023

sam-or commented Oct 8, 2023 • edited Loading

guacs commented Oct 18, 2023

sam-or commented Oct 18, 2023

guacs commented Oct 21, 2023

guacs left a comment

Choose a reason for hiding this comment

guacs left a comment

Choose a reason for hiding this comment

github-actions bot commented Nov 12, 2023

sam-or commented Nov 12, 2023

guacs left a comment

Choose a reason for hiding this comment

sam-or commented Sep 27, 2023 •

edited

Loading

guacs commented Oct 8, 2023 •

edited

Loading

sam-or commented Oct 8, 2023 •

edited

Loading

sam-or commented Oct 8, 2023 •

edited

Loading