Processing large Whole Slide Images with MESMER- Potential memory issues? #582

rjesud · 2022-02-10T18:22:19Z

Hi,
Thanks for developing this useful tool! I’m eager to apply it in our research.
I am encountering what I believe is an out of memory crash when processing a large digital pathology whole slide image (WSI). This specific image Is 76286 x 44944 (XY) at .325 microns/pixel. While this is large, it is not atypical in our WSI datasets. Other images in our dataset can be larger.
I have done benchmarking with the data and workflow described in another ticket here: #553. I can confirm results and processing time (6min) as shown here: #553 (comment). So it appears things are working as expected.
So, I am not sure where I am hitting an issue with my image. I assumed that the image was being sent to GPU in memory-efficient batches. Perhaps this is occurring at the pre-processing or post-processing stage? Should I explore a different workflow for Mesmer usage with images of this size?

I am using:
Deep-cell 0.11.0
Tensorflow 2.5.1
Cudnn 8.2.1
Cudotoolkit 11.3.1
GPU: Quadro P6000 computeCapability: 6.1
coreClock: 1.645GHz coreCount: 30 deviceMemorySize: 23.88GiB deviceMemoryBandwidth: 403.49GiB/s

CPU: 16 cores, 128 GB RAM

Seems to be talking to GPU:

Unfortunately, does not complete:

msschwartz21 · 2022-02-10T19:00:25Z

Hi rjesud,
I expect @ngreenwald may have some specific advice to add, but in the meantime here's a few things to look at.

One notable difference between your data and the benchmarks described in #553 is that you specify image_mpp=0.325. The model was trained with 0.5, so the first thing the application does is to resize your image to match the resolution that the model was trained with. If you can, please try the options described below and let us know the results so that we can diagnose the problem.

Run predictions on your data without specifying the resize. The results may not be perfect, but it will confirm that the rest of the system is working correctly. labeled_image = app.predict(im, compartment='whole-cell')

Run the resize function in isolation and see if it completes in a reasonable time:

from deepcell_toolbox.utils import resize

shape = im.shape
image_mpp = 0.325
model_mpp = app.model_mpp

scale_factor = image_mpp / model_mpp
new_shape = (int(shape[1] * scale_factor),
                     int(shape[2] * scale_factor))
image = resize(image, new_shape, data_format='channels_last')

ngreenwald · 2022-02-10T21:55:50Z

There's indeed a post-processing step that operates across the entire image. I'm guessing for something this large, it will make more sense to generate segmentation predictions on large patches (say 10k x 10k pixels), and then stitch them together into a single image. This isn't something that's currently supported by the application itself, but you could implement it with a for loop. If it ends up being useful, we could discuss incorporating it as an option inside the application, perhaps with some intelligent behavior at the border between patches.

rjesud · 2022-02-10T22:42:22Z

@msschwartz21,

Thank you for the suggestion.

Running the prediction without specifying resize did not complete. Again, I assume out of memory.
Interestingly, the resize function also fails.

@ngreenwald, tile/patch-wise post processing would be great! Can we add this as a feature request?

ngreenwald · 2022-02-10T23:52:28Z

Absolutely, feel free to open a separate detailing the requirements and what an ideal interface would look like. No promises on having something immediately ready. In the meantime, something like this should work for you:

labeled_image = np.zeros_like(im)
step_size = 10000

for row in range(0, labeled_image.shape[0], step_size)
      for col in range(0, labeled_image.shape[1], step_size)
           labeled_image[row:(row + step_size), col:(col + step_size)] = / 
                     app.predict(im[row:(row + step_size), col:(col + step_size)]

rjesud · 2022-02-12T00:34:57Z

@ngreenwald Thanks for this snippet! I'll give it a try. We'll probably have to get fancy with tile overlaps to solve the border cell issue.

ngreenwald · 2022-02-12T02:10:32Z

Yes, although with 10k x 10k crops, the number of cells at the border would quite a small overall percentage of the image, might be fine to just leave them as is. Up to you of course.

rjesud · 2022-02-22T01:18:17Z

Hi, I have a follow up question. Is the predict function executing normalization of the image array prior to inference? If so, is it possible to override this step and perform it outside of the function. This would be the preferred method to ensure each tile is using the same normalization scaling.

ngreenwald · 2022-02-22T23:35:12Z

Yes, you can pass kwarg to the function to control whether normalization happens or not, see here. However, I would verify that you see a decrease in performance before substituting your own normalization. The normalization we're using, CLAHE, already adjusts across the image to some degree, so I don't know that it will necessarily cause issues to have it applied to different tiles, especially tiles as large as 10k x 10k.

The only area where I've seen issues before is if entire tiles are devoid of signal, and it's just normalizing background to background.

rjesud added the bug Something isn't working label Feb 10, 2022

EmiliaCXY mentioned this issue Feb 22, 2023

Got only 0s and 1s in segmentation output #652

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing large Whole Slide Images with MESMER- Potential memory issues? #582

Processing large Whole Slide Images with MESMER- Potential memory issues? #582

rjesud commented Feb 10, 2022

msschwartz21 commented Feb 10, 2022

ngreenwald commented Feb 10, 2022

rjesud commented Feb 10, 2022

ngreenwald commented Feb 10, 2022 •

edited

Loading

rjesud commented Feb 12, 2022

ngreenwald commented Feb 12, 2022

rjesud commented Feb 22, 2022

ngreenwald commented Feb 22, 2022

Processing large Whole Slide Images with MESMER- Potential memory issues? #582

Processing large Whole Slide Images with MESMER- Potential memory issues? #582

Comments

rjesud commented Feb 10, 2022

msschwartz21 commented Feb 10, 2022

ngreenwald commented Feb 10, 2022

rjesud commented Feb 10, 2022

ngreenwald commented Feb 10, 2022 • edited Loading

rjesud commented Feb 12, 2022

ngreenwald commented Feb 12, 2022

rjesud commented Feb 22, 2022

ngreenwald commented Feb 22, 2022

ngreenwald commented Feb 10, 2022 •

edited

Loading