Problem
Long documents currently require excessive GPU memory allocation.
Current workaround
Commit fe8a934 increases GPU memory allocation, which is not scalable.
Proposed solution
Split long PDF documents into smaller chunks before processing instead of allocating more GPU memory.