server : (webui) add support for .pdf file upload #11647

dannyl1u · 2025-02-04T07:25:41Z

Allows uploading of .pdf files, uses pdf.js to parse into text and prepends the text of the uploaded pdf(s) to the prompt.

Uploaded pdf(s) can be deleted and will not be included in the prompt.

Please let me know if any changes should be made (e.g. prompt structure)

Demo video (apologies for poor video quality, GitHub only allows up to 10mb):

fileupload.mp4

ggerganov · 2025-02-04T11:11:55Z

Does it also work with other plain-text formats? For example, .txt, .h/.cpp, etc.

woof-dog · 2025-02-04T13:44:56Z

If there is an attachment button near the "Stop"/"Send", I'd really appreciate it if it's hidden by default but able to be turned on in the settings because I have to manually press "Stop" all the time and would not like to accidentally click the file attachment button.

You might consider also turning the textarea used for entering responses into a drop zone so you can drag and drop files there. That would really make the UX better for me since having to go through a file picker UI would probably take longer than opening the file and copying+pasting.

Also it seems you are basing this on an old commit, there were several changes to the textarea in the last few days, be sure to be careful rebasing - don't want to revert those other changes.

@ggerganov It appears the file input only accepts .pdf files on this PR

ngxson · 2025-02-04T17:04:16Z

This is a nice idea. But I'm still a bit hesitate about having this function built-in. Problems are:

pdf.js does not work well with PDF containing tables and images
The bundle size is quite big, +800kb gzip in this case

My speculation is that frontend-only PDF is not that good in practice, so probably we should not add this as a permanent functionality. Instead, hidden it behind a toggle in "settings" page, and using the CDN pre-built package seems to be a better solution. If users use it and really love it, we can bundle it inside llama-server later on.

In near future, what I'm thinking is to introduce a skeleton for "experimental" UI functionalities, so more things can be added in the future without risk of breaking the UI/UX. Things already on my list are:

PDF parsing
Model context protocol (discussed in another PR)
Equivalent of "canvas" on claude / chatgpt
On-browser python (Pyodide)
Or even the whole linux emulator on-browser (WebVM)

ngxson · 2025-05-15T10:36:00Z

Superseded by #13562 , which allow converting PDF to either text or image

dannyl1u added 2 commits February 3, 2025 22:45

server: (webui) file upload and pdf parsing

8d721dc

server: (webui) use Map() to store file content

b7a0c02

dannyl1u requested a review from ngxson as a code owner February 4, 2025 07:25

github-actions bot added examples server labels Feb 4, 2025

ngxson mentioned this pull request Feb 4, 2025

Feature Request: (webui) Implement a experimental features on webui #11662

Closed

6 tasks

ngxson mentioned this pull request May 15, 2025

webui : handle PDF input (as text or image) + convert pasted long content to file #13562

Merged

ngxson closed this May 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : (webui) add support for .pdf file upload #11647

server : (webui) add support for .pdf file upload #11647

Uh oh!

dannyl1u commented Feb 4, 2025 •

edited

Loading

Uh oh!

ggerganov commented Feb 4, 2025

Uh oh!

woof-dog commented Feb 4, 2025

Uh oh!

ngxson commented Feb 4, 2025

Uh oh!

ngxson commented May 15, 2025

Uh oh!

Uh oh!

server : (webui) add support for .pdf file upload #11647

server : (webui) add support for .pdf file upload #11647

Uh oh!

Conversation

dannyl1u commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Feb 4, 2025

Uh oh!

woof-dog commented Feb 4, 2025

Uh oh!

ngxson commented Feb 4, 2025

Uh oh!

ngxson commented May 15, 2025

Uh oh!

Uh oh!

dannyl1u commented Feb 4, 2025 •

edited

Loading