Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Millions of .bam files created during POLCA #329

Open
drew-olson-tmru opened this issue Sep 20, 2023 · 2 comments
Open

Millions of .bam files created during POLCA #329

drew-olson-tmru opened this issue Sep 20, 2023 · 2 comments

Comments

@drew-olson-tmru
Copy link

Hi,

I ran POLCA with default parameters:
polca.sh -a $ASMFILE -r 'read1.fastq.gz read2.fastq.gz' -t 64 -m 20

However, I cancelled the job after 20 hours on a single fungal genome (~45Mb) and after over 11 million intermediate files were created (mostly .bam files). Is this supposed to happen? I have executed the code from both my HPC server installation and from a conda environment and I obtained the same result. This seems like this is not supposed to happen as this is supposed to be a fast and accurate tool to use. Are the intermediate files not being disposed of properly during the job?

Any help would be apreciated!

@Ifigene
Copy link

Ifigene commented Dec 13, 2023

Dear developers,

I have the same problem. I have also used the default parameters and version 4.0.9.

Can you please help?

@Ifigene
Copy link

Ifigene commented Dec 19, 2023

The solution I have found is to remove the option -m for memory.
Then masurca outputs the "normal" amount of files!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants