Skip to content

Error: segfault from C stack overflow #5243

Closed
@bhive01

Description

I have a data analysis pipeline who's last step is knitting an HTML report with all of the plots I've made. I've been running the latest tidyverse from github because I like across() and where(), but one dataset has routinely been giving me the error in the title and no other information.
I don't know how to debug this error or dig into the code to understand what function is causing the error. I've narrowed it down to the one file, but the minimal example I have is not minimal in any way and the data I have is proprietary so I've put it here for now... 😬 😬 😬
https://drive.google.com/file/d/1lhz7kOTZTKT7Dt0D5bcmbwSldiOLwe6F/view?usp=sharing

I've posted my session info here because it's huge:
https://gist.github.com/bhive01/86f54f459b287784274c16f0586f08bc
My tibble looks like this:

> moa
# A tibble: 3 x 16
  file                        check_map       SPECIES            COMNM     LCCD  TRLID   MATNM     data              TRT_order bar_color corr_df          pca_title                          PCAs             corr_mat corr_plot var_data        
  <chr>                       <list>          <chr>              <chr>     <chr> <chr>   <chr>     <list>            <list>    <list>    <list>           <chr>                              <list>           <list>   <list>    <list>          
1 Strawberry (NZS1802)_Maste… <tibble [5 × 5… Fragraria x anana… Strawber… NZPN  NZS1802 Albion    <tibble [1,483 ×… <chr [5]> <chr [5]> <tibble [60 × 4… NZPN   Strawberry - NZS1802   Alb… <tibble [120 × … <rcorr>  <gg>      <tibble [478 × …
2 Strawberry (NZS1802)_Maste… <tibble [5 × 5… Fragraria x anana… Strawber… NZPN  NZS1802 Camarosa  <tibble [131 × 1… <chr [5]> <chr [5]> <tibble [17 × 1… NZPN   Strawberry - NZS1802   Cam… <tibble [48 × 2… <rcorr>  <gg>      <tibble [189 × …
3 Strawberry (NZS1802)_Maste… <tibble [5 × 5… Fragraria x anana… Strawber… NZPN  NZS1802 San Andr… <tibble [1,233 ×… <chr [5]> <chr [5]> <tibble [50 × 5… NZPN   Strawberry - NZS1802   San… <tibble [130 × … <rcorr>  <gg>      <tibble [518 × …

This tibble is processed by the following functions moa %>% render_html_reports(.), which call these functions.

render_one <- function(df, COMNM, LCCD, TRLID) {
	rmarkdown::render(
		here('PerTrialReportDev.Rmd'),
		output_file = here("DEVoutputs", glue::glue("{COMNM}"), glue::glue("{LCCD}"), glue::glue("{TRLID}"), glue::glue("{LCCD}_{TRLID}_Experiment_Data_Summary.html")),
		params = list(TRLID = TRLID, input_data = df, trial_description = glue::glue("{LCCD}_{COMNM}-{TRLID}"))
	)
}

render_html_reports <- function(df) {
	df %>% 
		nest(trl_data = -c(SPECIES, COMNM, LCCD, TRLID))  %>%
		{pwalk(list(.$trl_data, .$COMNM, .$LCCD, .$TRLID), ~render_one(..1, ..2, ..3, ..4))}
			
	return(df)
}

The Rmd file is here:
https://gist.github.com/bhive01/6f68c3301dbfb75ba0063db2abbc7f64

The trial that I'm running has the most visualizations out of all, but I'm watching the RAM on my machine and it's not running out of RAM (R using 3 of 32 GB). I'm on a newish Mac Mini.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions