-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extra large kmer #22
Comments
Hi @apaytuvi, Thanks for using cuttlefish! I'll incorporate the capability of using extra-large k-mers into cuttlefish; but that might take a little time. In the meantime, I can try posting a hack in a separate branch for you to try it out. Would it work you? Regards. |
That would be great. Thank you so much! |
Hi @apaytuvi: we've found some bug(s) in the initial k-mer enumeration phase of cuttlefish, only occurring with huge |
Thanks Jamshed for these efforts! No problem, I'll wait. Thanks again.
…________________________________
From: Jamshed Khan ***@***.***>
Sent: Sunday, October 30, 2022 9:08:56 PM
To: COMBINE-lab/cuttlefish ***@***.***>
Cc: apaytuvi ***@***.***>; Mention ***@***.***>
Subject: Re: [COMBINE-lab/cuttlefish] Extra large kmer (Issue #22)
Hi @apaytuvi<https://github.com/apaytuvi>: we've found some bug(s) in the initial k-mer enumeration phase of cuttlefish, only occurring with huge k-values (e.g. with k >= 1000)—hence the delay! I'll get back to this once we could address the issue.
—
Reply to this email directly, view it on GitHub<#22 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACVVIQPBSXA2G2IECMQW3O3WF3IVRANCNFSM6AAAAAARK6RQIQ>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
It seems the bug has been solved and this feature should be available. Could you please confirm that @jamshed? Thanks a lot! |
Hi @apaytuvi: sorry for the delay in response! I've pushed a new branch, extra-large-k, with the required support. This needs to be compiled from source, as instructed here. But the cmake -DINSTANCE_COUNT=256 -DCMAKE_INSTALL_PREFIX=../ .. Currently this supports Let me know if you could test it successfully! |
Dear cuttlefish authors,
Thank you for this useful tool. I have a large database of genomes and I want to reduce the redundancy to reduce the computational time, improve speed and reduce RAM usage of a mapping against such a big database. I tried cuttlefish, and is useful but I would like a larger kmer, let's say, e.g. 1000. Why? Long-read technology requires long sequences for a correct mapping, but by setting low kmer lengths such as 127 most sequences remain that size, which is clearly not enough for long-read mapping. Do you have a suggestion for that?
Thanks,
The text was updated successfully, but these errors were encountered: