- Bug fixes to
inspect_types()
for pairwise comparison plots - Updated tests for
inspect_types()
for pairwise comparisons - Migrate to
vdiffr
1.0.0 for graphics tests - Fixed
#32,
improved appearance of
inspect_cat()
plots when number of categories is small. Thanks to Ivo Kwee for the suggestion. - Fixed
#40,
ensuring
inspect_num()
histogram bins are correct in comparisons of data frames. Thanks to Roel Verbelen for the report.
- Add
include_int
option ininspect_cat()
to allow treatment of integer columns as categorical. - Improved p-values associated with binned categorical and numeric
comparisons. This is now based on a modified chi-squared test and is
labelled as
pval
in the resulting output. - Fixed
#27
ensuring plots for
inspect_cat()
respect any filtering or sorting of the summary output prior toshow_plot()
. Thanks to Roel Verbelen for the report. - Additional detail in
inspect_type()
comparison of two dataframes to make it easier to see which columns and types differ.
- Minor change, ensuring all functions use
return
properly.
- Important change: the
show_plot
argument has been removed from allinspect_*()
functions. To generate visualisations of data frame summaries, please use the more flexibleshow_plot(inspect_*())
or via the pipeinspect_*() %>% show_plot()
. show_plot()
improvement that nudges points that might otherwise have coincided for dataframe comparisons of imbalance (for example, withinspect_imb(df1, df2) %>% show_plot()
)- Plots for grouped summaries:
inspect_cor()
,inspect_na()
and `inspect. inspect_cor()
slight speed up for dataframes with large numbers of columns.inspect_cor()
can be filtered prior to plotting, for exampleinspect_cor(starwars) %>% filter(abs(corr) > 0.2) %>% show_plot()
. Thanks to Roel Verbelen for the suggestion- Fixed bug causing
inspect_imb()
to fail on certain types of factor columns. Thanks to Roel Verbelen for the report. show_plot()
has new argumentslabel_size
,label_angle
andlabel_color
. Each provide adjustments to text annotation where applicable. Thanks to Bartosz Bursa for the suggestion.- changes to text annotation to improve how
coord_flip()
works on resulting plots. Thanks to Roel Verbelen for the report.
- Added
bytes
column toinspect_mem()
output, for downstream numeric comparison and consistency withinspectpd
. - Added
pcnt_nna
column toinspect_cor()
output containing the percentage of pairwise complete observations used calculated correlations. Thanks to Theo Broekman for the suggestion. - Fixed bug causing order of grouping variable in grouped
inspect_
statements to be incorrect. Thanks to the report from Theo Broekman. - Removed erroneous print statement from
inspect_num()
.
- Updates to documentation throughout.
inspect_*
functions now returns results by group grouped dataframes.- Added option for
inspect_num() %>% show_plot()
to show histograms with color palettes specified by thecol_palette
argument. - Fixed bug causing
inspect_imb()
to sometimes fail when factors present. Thanks to Doug Friedman for the report.
- Fixed error causing
inspect_num()
to fail when columns contained allNA
values. Thanks to Ryan Tanner for the report - Speed-up of
inspect_cor()
for large data frames with many numeric columns. - Added approximate confidence intervals and tests for
method = 'kendall'
andmethod = 'spearman'
ininspect_cor()
.
- Fix issue causing
inspect_na() %>% show_plot()
to fail when 0NA
present. Thanks to the report by Metin Yazici. show_plot()
now returns aggplot2
object rather than printing the plot - thanks to Garrick Aden-Buie for the suggestion.- Dramatic speed up of
inspect_cat
plotting by avoiding text labels for small regions. - Added
tech
dataset. - Fix for text annotation of
inspect_cat()
plots when labels are empty strings. By default""
will be shown. Thanks to Michael Swenson for the report inspect_cor(method = ...)
argument added, thanks to suggestion from George Dontas. Options forpearson
,spearman
andkendall
. Note that confidence intervals and tests currently only supported forpearson
.- Fix error when duplicate factor labels present in
inspect_cat()
&inspect_imb()
.
text_labels
autoscale size usingggfittext::geom_fit_text()
. For an example seeinspect_cat()
. Thanks to David Wilkins for the PR.- 6 different color palettes supported in
show_plot()
viacol_palette
argument. Colorblind friendly option specified viashow_plot(col_palette = 1)
- thanks to Richard Careaga for the suggestion. inspect_imb()
.include_na
option for categorical columns that are 100% missing, or constant are underlined in plot for easier comprehension.
inspect_cor()
- Points and whiskers changed to coloured bands for single dataframe summaries - these are easier to see when CIs are narrow.
- Points changed to bars for
inspect_cor()
comparison plots - makes it easier to see smaller differences in correlations. NA
correlations omitted frominspect_cor()
comparison when plotted. Ordering of correlations reversed to be consistent with returned tibble.
show_plot()
function (show_plot
argument ininspect_
functions will be dropped in a future version)high_cardinality
argument inshow_plot()
for combining unique or near-unique categories for plottinginspect_cat()
.progress
bars shown when processing larger datasets- Improvements to plots throughout
- Initial CRAN release