-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow .gz files as input #2
Comments
Hi Shahab, I wonder whether this has yet been implemented? I'm working on a huge dataset and not having to unzip all fastq files would be super useful. Many thanks, Ben |
Hi Ben, ultimately inputs need to be decompressed since skmer runs jellyfish internally, and jellyfish doesn't support compressed inputs. I can add .gz input support to skmer, but be ware that effectively it would decompress the input, write it to a temp disk space, and then remove it after the processing, something that can be done using a wrapper (ex. bash) script too. I'll try to implement and test this when I find some time to work on it. I'll post here once done. |
Hi Shahab, Thank you for providing these insights. It might be quite a computational burden if decompressing (and after running Skmer re-compressing) needs to be done in a single job, and maybe your suggestion of a wrapper script makes more sense. I've done that for now on my on task. But if you can find a straightforward way of dealing with compressed files using parallelisation (e.g. using pigz) then it might still be worthwhile as it would make Skmer a bit more user-friendly. |
No description provided.
The text was updated successfully, but these errors were encountered: