Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

min_signal_length when creating training data #32

Open
aroelo opened this issue Oct 9, 2019 · 0 comments
Open

min_signal_length when creating training data #32

aroelo opened this issue Oct 9, 2019 · 0 comments

Comments

@aroelo
Copy link

aroelo commented Oct 9, 2019

Hi, I am in the process of creating a training set for deepbinner and noticed that a lot of my reads are excluded, because the signal length is too short.

When using the porechop command (deepbinner porechop porechop.out /path/to/fast5_dir > raw_training_data) 455782 of the 650483 reads are skipped for being too short.

I see that the default value is set at 20000 and am thinking about lowering this so I don't lose that many reads, but am unsure how this would influence the performance of deepbinner.

Is it a strict requirement for the signal length to be that long? I couldn't find more information about this parameter in the documentation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant