Description
Describe the bug
When running the pipeline, if there is an issue with one of the samples, the entire pipeline fails. After removing the problematic sample and resuming the run, the pipeline takes a significantly longer time to recover and reuse cached results compared to earlier versions. Although the cache is flagged, it appears that the pipeline is redoing parts of the analysis instead of properly using the cache.
Impact
This issue is causing much longer run times and inefficient resource usage when recovering from an error. It slows down the workflow considerably, especially when working with large datasets.
To Reproduce
Environment: Local (Docker profile used)
Pipeline version: cdcgov/phoenix v2.1.1
Command run: nextflow run main.nf -profile docker -entry PHOENIX --input <samplesheet.csv> --kraken2db
Custom config file: Not used
Text of the error:
No specific error text upon resuming. After removing the faulty sample, the resumption process is significantly slower, and re-analysis of already completed parts occurs even though the cache is flagged.
Files that caused the error:
Available upon request if needed.
Expected behavior
After removing the problematic sample and resuming, the pipeline should quickly reuse the cached results and only rerun necessary steps, similar to behavior in earlier versions.
Screenshots
N/A
Logs
Logs can be provided if needed. (.nextflow.log and relevant .command.err / .command.out files)
Additional context
In previous versions of the pipeline, resuming after error recovery was much faster and efficient. This change in behavior might be related to recent updates in the pipeline structure or Nextflow version handling.