OCR slows down with ollama-vision models

**Describe the bug**
I use Ollama running minicpm to OCR image-based subtitles into text-based via SubtitleEdit 4.0.12. It starts with taking 0.5s to read the image and output text gradually slowing down. After ~40minutes of running it slowed down to 12s to process a single image. Pausing the OCR and restarting ollama-server reset the slowdown cycle and the process of slowdown started again from 0.5s

GPU Utilization also drops from initial ~80% to ~8% with momentary spikes to ~20% when the output is created. 

**How to reproduce**
Steps to reproduce the error:
1. Load the model (minicpm-v:8b)
2. Input image-based subtitles into Subtitle edit
3. Select OCR method Ollama-vision, select minicpm-v:8b 
4. Start OCR

**Screenshots**
![Image](https://github.com/user-attachments/assets/2ef8ebed-366f-445a-815d-4f06eba37e8b)
![Image](https://github.com/user-attachments/assets/36798963-0293-41ff-bd22-ef8cfc4af742)
![Image](https://github.com/user-attachments/assets/a7c3c691-6e7a-4927-b353-af6e5632782c)

Using latest nightly build of portable ipex-llm-ollama




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OCR slows down with ollama-vision models #13195

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

OCR slows down with ollama-vision models #13195

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions