Skip to content

[FEATURE]: Add quality checks workflow #173

Open
@mwojtyczka

Description

@mwojtyczka

Is there an existing issue for this?

  • I have searched the existing issues

Problem statement

When DQX is installed in the workspace, we provide profiling workflow/job to generate quality rule candidates. After that user can use the generated checks to do quality checks. However, performing the actual quality checking requires code level integration. In some cases, users may want to just do quality checking on the existing input data without performing any transformations.

Proposed Solution

Add quality checks workflow/job, that can be run to assess data quality of a dataset and write out quarantine and output to locations as specified in the config. This won't give the possibility to perform any data transformations as part of the quality checking but it would be useful in a lot of scenarios.

User journey:

  1. (option) Profiler job to profile the data and generate quality rule candidates
  2. Run Quality Check job to assess data quality of the input data, and persist the results as per config
  3. Refresh quality dashboard

Additional Context

No response

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions