-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incredibly small assembly #54
Comments
HI, Best, |
The largest contig is 94Kb and the next largest is in the 20Kb range. There are bacteria inside the fungal sample, but I expect at least 3Mb for those genomes. genomescope2.0 estimates 24Mb, which corresponds to my haslr assembly. I have not used this genome size estimator before so I will include the output for the illumina forward reads to see what you think. |
property min max |
Hi, |
pandora.minia.41.log I do not have access to a computer with enough memory to run WenganD at the moment. Our compute cluster is moving to a new scheduler soon and we are being advised to wait until after the transition before requesting new software. |
I see in the logs that minia generated 4.8 Million contigs with a total assembly length of 1.8Gb, from this numbers I can conclude that the minia assembly is extremely fragmented (Average contig length of 400bp), by default Wengan discard contigs shorter than 500bp, moreover contigs larger than 2kb are used to build the assembly backbone. Although you can modify the 2kb parameter(-M 2000), the minimum recommended is 1kb (-M 1000) but I think that will be not enough if you assemble the short reads with minia. Then, most contigs are being discarded due to these length constraints, you end up with a much shorter assembly. My recommendation is to try WenganA or WenganD, as your genome is not large, WenganD might be able to finish in a machine with 50-60Gb RAM. Alex |
I switched to my institution's compute cluster to use WenganD. Are you familiar with this error message? gzip: stdout: Broken pipe Below is my submission script. /lustre/haven/user/wbrewer5/wengan/wengan.pl -x ontraw -a D |
Well, the message is not very informative. Perhaps the job was killed? |
I am trying to assemble a fungal (maybe some bacterial hitchhikers) genome with 12.16 GB zipped Nanopore sequencing and 12.5 GB zipped Illumina reads. The current issue is that my first assembly using Wengan is only 651Kb in length. I will post my code and output below. Which log files are helpful in finding the issue?
wengan.pl -x ontraw
-a M
-s pand2_fwd.fastq.gz,pand2_rev.fastq.gz
-l pandora_clean_nanopore.fastq.gz
-p pandora
-t 8
-g 110
pandora.liger.log
The text was updated successfully, but these errors were encountered: