Skip to content

Design data extract and data merge #36

@pawelru

Description

@pawelru

Motivation

Data extract/data merge has been criticised by many parties mainly for opaqueness and complicated API. The main goal of the refactor should be:

  1. Remove independent filtering on the selector level, to avoid scenarios like below where ADLB is merged to ADLB. No other data-visualization tools offers such a functionality. Possible data transformation (reshape or filter) should be done once for dataset before variables selection. Independent filtering also complicates API call to initialise encoding panel. data_extract_spec sometimes covers more than hanf of the code space:

    a <- data_extract_spect("ADSL", select_spec(...))
    b <- data_extract_spec("ADLB", filter_spec(<PARAMCD>), filter_spec(<AVISIT>), select_spec(...))
    c <- data_extract_spec("ADLB", select_spec(...))
    Screenshot 2022-05-18 at 06 37 53

    Selecting variables to the module should be minimalized to specifying variables only (without data transformation). Still open question whether we allow to select from different datasets (support automatic merge) or allow to select from one dataset only (no automatic merge).

    a <- select_variables(<select from numeric vars in ADSL>)
    b <- select_variables(<AVAL from ADLB>)
    c <- select_variables(<select from factor vars in ADSL>)
  2. If we decide to keep offering automatic merge we need to provide UI element informing that there is merge and how it's performed (used keys, shape of the final ANL)

Solutions in the comments.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions