Skip to content

Improve runtime for predict_library by running DeepLC and IM2Deep predictions only once #243

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 2 commits into
base: releases
Choose a base branch
from

Conversation

rodvrees
Copy link
Member

@rodvrees rodvrees commented Mar 12, 2025

Key changes:

Changes to predict_library function:

  • Added conversion of search space to PSMList and filtering of PSMs by mz in predict_library function (ms2pip/core.py).
  • Moved steps to add retention time and ion mobility predictions if specified (ms2pip/core.py).

Improvements in search_space handling:

  • Introduced to_psm_list method to convert search space to PSMList (ms2pip/search_space.py).
  • Modified build method to ensure the number of processes is correctly set and used for parallelization (ms2pip/search_space.py).

Timing of runs on releases and this branch, on a small fasta subset of human proteome (semi-tryptic) (~240.000 PSMs, so 3 batches):
releases: 3099.83s user 1460.12s system 1312% cpu 5:47.51 total
this branch: 2405.46s user 596.67s system 1334% cpu 3:44.90 total

@rodvrees rodvrees marked this pull request as draft March 12, 2025 12:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

1 participant