ard_compare draft #437 #527

malanbos · 2025-12-01T15:24:30Z

What changes are proposed in this pull request?

New function to compare two ARDs: ard_compare

Comes with top-level ard_compare function, as well as a script with ard_compare_helpers.R, and check_environment.R to modularize it.

Pre-review Checklist (if item does not apply, mark is as complete)

All GitHub Action workflows pass with a ✅
PR branch has pulled the most recent updates from master branch: usethis::pr_merge_main()
If a bug was fixed, a unit test was added.
Code coverage is suitable for any new functions/features (generally, 100% coverage for new code): devtools::test_coverage()
Request a reviewer

Reviewer Checklist (if item does not apply, mark is as complete)

If a bug was fixed, a unit test was added.
Run pkgdown::build_site(). Check the R console for errors, and review the rendered website.
Code coverage is suitable for any new functions/features: devtools::test_coverage()

When the branch is ready to be merged:

Update NEWS.md with the changes from this pull request under the heading "# cards (development version)". If there is an issue associated with the pull request, reference it in parentheses at the end update (see NEWS.md for examples).
All GitHub Action workflows pass with a ✅
Approve Pull Request
Merge the PR. Please use "Squash and merge" or "Rebase and merge".

Optional Reverse Dependency Checks:

Install checked with pak::pak("Genentech/checked") or pak::pak("checked")

# Check dev versions of `cardx`, `gtsummary`, and `tfrmt` which are in the `ddsjoberg` R Universe
Rscript -e "options(checked.check_envvars = c(NOT_CRAN = TRUE)); checked::check_rev_deps(path = '.', n = parallel::detectCores() - 2L, repos = c('https://ddsjoberg.r-universe.dev', 'https://cloud.r-project.org'))"

# Check CRAN reverse dependencies but run tests skipped on CRAN
Rscript -e "options(checked.check_envvars = c(NOT_CRAN = TRUE)); checked::check_rev_deps(path = '.', n = parallel::detectCores() - 2, repos = 'https://cloud.r-project.org')"

# Check CRAN reverse dependencies in a CRAN-like environment
Rscript -e "options(checked.check_envvars = c(NOT_CRAN = FALSE), checked.check_build_args = '--as-cran'); checked::check_rev_deps(path = '.', n = parallel::detectCores() - 2, repos = 'https://cloud.r-project.org')"

…overlapping key values, enabling mismatches to surface through the full join comparison.

ddsjoberg

Thank you @malanbos for this!

For now, let's skip env handling. I think it's more complex than we need. We can revisit in the future. Let me know if you'd like to discuss

ddsjoberg · 2025-12-11T00:26:02Z

R/ard_compare.R

+#'
+#' ard_compare(ard_base, ard_modified)$stat
+#'
+ard_compare <- function(x, y, key_columns = NULL) {


All of our functions that begin with ard_*() create new ARDs. Let's name the function compare_ard().

ddsjoberg · 2025-12-11T00:28:14Z

R/ard_compare.R

+#'
+#' ard_compare(ard_base, ard_modified)$stat
+#'
+ard_compare <- function(x, y, key_columns = NULL) {


Let's make the default value keys = c(all_ard_groups(), all_ard_variables(), any_of(c("variable", "variable_level", "stat_name"))).

ddsjoberg · 2025-12-11T00:29:31Z

R/ard_compare.R

+#'
+#' ard_compare(ard_base, ard_modified)$stat
+#'
+ard_compare <- function(x, y, key_columns = NULL) {


Let's add an argument of the columns to compare, compare = any_of(c("stat_label", "stat", "stat_fmt")).

(Is there anything else we should compare by default?)

ddsjoberg · 2025-12-11T00:30:03Z

R/ard_compare.R

+  check_class(x, cls = "card")
+  check_class(y, cls = "card")
+
+  .validate_environment_metadata(x, y, call = get_cli_abort_call())


Let's remove the env checking for now. It's quite complicated.

ddsjoberg · 2025-12-11T00:37:04Z

R/ard_compare.R

+
+  .validate_environment_metadata(x, y, call = get_cli_abort_call())
+
+  primary_x <-


Here we can evaluate the keys and compare columns with

keys <- .process_keys_arg(x, y, keys = {{ keys }}) compare <- .process_compare_arg(x, y, compare = {{ compare }}) # outside the function we define these functions .process_keys_arg <- function(x, y, keys) { keys <- intersect(cards_select({{ keys }}, data = x), cards_select({{ keys }}, data = y)) .check_not_empty(keys) cli::cli_inform("The comparison {.arg keys} are {.emph {.val {keys}}}.") keys } .process_compare_arg <- function(x, y, compare) { # add checks and return evaluated compare vector... } .check_not_empty <- function(x, arg_name = rlang::caller_arg(x)) { if (rlang::is_empty()) { cli::cli_abort("The {.arg {arg_name}} argument cannot be empty.") } invisible(x) }

ddsjoberg · 2025-12-11T04:01:58Z

R/ard_compare.R

+  fmt_column <- if ("fmt_fun" %in% names(x) || "fmt_fun" %in% names(y)) {
+    "fmt_fun"
+  } else if ("fmt_fn" %in% names(x) || "fmt_fn" %in% names(y)) {
+    "fmt_fn"
+  } else {
+    "fmt_fun"
+  }


Let's just use the columns provided in the compare argument to assess which comparisons to make. We can compare all columns in the same way.

ddsjoberg · 2025-12-11T04:10:59Z

R/ard_compare.R

+    y_selected <- .ensure_column(y_selected, column)
+  }
+
+  # .check_rows_not_in_x_y(x_selected, y_selected, key_columns)


Here we can initialize an empty list of results.

results <- rlang::rep_named(c("rows_in_x_not_y", "rows_in_y_not_x"), list(NULL)) results[["compare"]] <- rlang::rep_named(compare, list(NULL))

In this example the "compare" element will also be a named list. The names are the columns that we compare.

We could then follow this up with calls to functions that will populate these parts of the list, e.g.

results[["rows_in_x_not_y"]] <- .compare_rows(x, y) # returns the results of the anti join of x and y on the key columns results[["rows_in_y_not_x"]] <- .compare_rows(y, x) # same as above, but reversed results[["compare"]] <- .compare_columns(x, y, compare) # loop through the columns we will compare and return a named list of data frames where each data frame contains the rows that are not equal between x and y. The data frame will have the key columns and the two columns compared (from x and y).

ddsjoberg · 2025-12-11T04:17:55Z

R/ard_compare.R

+
+  names(mismatch_list) <- names(comparison_targets)
+
+  mismatch_list


Lastly, the function will return the results object, and add a class onto this list.

After we get this settled, we will write a print method for class to make it nice.

malanbos added 5 commits November 11, 2025 23:46

initial ard_compare()

4a6fec3

remove unneeded check

214de0b

split out checking of environments for ard_compare

e836002

split out functions to separate helper file

ce571cb

Allowed ard_compare() to proceed without aborting when ARDs have non-…

e73c917

…overlapping key values, enabling mismatches to surface through the full join comparison.

malanbos requested a review from ddsjoberg December 1, 2025 15:24

qualify utils::head

ea53426

ddsjoberg requested changes Dec 11, 2025

View reviewed changes

Merge branch 'main' into main

654f965

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ard_compare draft #437 #527

ard_compare draft #437 #527

Uh oh!

malanbos commented Dec 1, 2025 •

edited

Loading

Uh oh!

ddsjoberg left a comment

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

ddsjoberg Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		.validate_environment_metadata(x, y, call = get_cli_abort_call())

		primary_x <-


		names(mismatch_list) <- names(comparison_targets)

		mismatch_list

Uh oh!

ard_compare draft #437 #527

Are you sure you want to change the base?

ard_compare draft #437 #527

Uh oh!

Conversation

malanbos commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ddsjoberg left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

malanbos commented Dec 1, 2025 •

edited

Loading