This repository was archived by the owner on Apr 26, 2023. It is now read-only.
This repository was archived by the owner on Apr 26, 2023. It is now read-only.
On optimization of access to hard disk. xmipp-extract-particles (and in geenral any extract particle protocol) #1798
Open
Description
Describe the bug
xmipp extract particles is in launching state for two hours before starting the first step
To Reproduce
Steps to reproduce the behavior:
- project 2018_10_10_rocio-arranz_cct_oct2018 (jaime)
- project contains around 6000 movies
- execute xmipp extract particles
- before creating the first step the protocol import the coordinates (in this case 1.500.000). This takes 90 minutes. during these 90 minutes the state of the protocol is launching and the
user does not get any feedback.
Coments
- I have modified the protocol so it prints a line each 50 processed movies. Nevertheless this is an step that should be optimized. Not sure this affects when running in streaming mode
- If I repeat the experiment with 200 movies (and 48000 coordinates) it takes 9 seconds
- processing the first 200 movies using the 6000 movies set takes more than 2 minutes. So accessing to the pos files when there are many is very time consuming.
Suggestions
- Reduce the number of accesses to disk -> batch programming, process several pos files together