Skip to content

sort_fuzz testing DX improvements #16233

Open
Open
@blaginin

Description

@blaginin

Is your feature request related to a problem or challenge?

datafusion/core/tests/fuzz_cases is a very cool tool! It helped to spot #16228. However, once the issue was discovered, it tool several hours to understand a problem and create a reproducible example

Partially that's because debug info to reproduce the problem right now is very big and not very useful:

Image

Describe the solution you'd like

Ideas I have to improve the DX:

  • Store dataset in a file instead of printing to stdout. The current version is too big and a bit messy - and it also makes reproducing issues harder.
  • Streamline seeds. There are some hardcoded seeds, but it's unclear how to call a function with arguments to get the same error-inducing dataset. It'd help to include a hint on how to generate a test batch for debugging later.
  • Add docs on how to extend, use, and maintain the suite (including the modules above).

Describe alternatives you've considered

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    development-processRelated to development process of DataFusionenhancementNew feature or request

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions