Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

rodvrees · 2025-03-12T12:18:59Z

Key changes:

Changes to `predict_library` function:

Added conversion of search space to PSMList and filtering of PSMs by mz in predict_library function (ms2pip/core.py).
Moved steps to add retention time and ion mobility predictions if specified (ms2pip/core.py).

Improvements in `search_space` handling:

Introduced to_psm_list method to convert search space to PSMList (ms2pip/search_space.py).
Modified build method to ensure the number of processes is correctly set and used for parallelization (ms2pip/search_space.py).

Timing of runs on releases and this branch, on a small fasta subset of human proteome (semi-tryptic) (~240.000 PSMs, so 3 batches):
releases: 3099.83s user 1460.12s system 1312% cpu 5:47.51 total
this branch: 2405.46s user 596.67s system 1334% cpu 3:44.90 total

…brary

rodvrees added 2 commits March 12, 2025 12:53

Refactor deeplc and im2deep and batching

221fc4b

Remove IM and RT parameters from predict_batch call inside predict_li…

59da00c

…brary

rodvrees marked this pull request as draft March 12, 2025 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

rodvrees commented Mar 12, 2025 •

edited

Loading

Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

Are you sure you want to change the base?

Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

Conversation

rodvrees commented Mar 12, 2025 • edited Loading

Changes to predict_library function:

Improvements in search_space handling:

rodvrees commented Mar 12, 2025 •

edited

Loading

Changes to `predict_library` function:

Improvements in `search_space` handling: