Long response time , gpu support? #2

drmetro09 · 2023-12-17T12:44:35Z

drmetro09
Dec 17, 2023

Hi , im new to this i have installed this pdf chat bot by docker compose on unraid , but its not using gpu and taking 15 mins to 20mis to generate response any help and guidance on this ? I want to ask if this supports gpu nividia

Answered by drmetro09

Dec 18, 2023

Update: Its finally working now , i downgraded from to v 0.1.10 and everything working fine its using the gpu without errors, tried using LLAMA2 model haven’t tried other models.
Thanks for your amazing work , i have few suggestions listed below , hope you would consider them for this project.

Increase size of pdf to be uploaded to more than 200mb maybe 600mb .
Option to include docx ,ppt , CSV , jpeg , jpg and other formats which are commonly used.
Option to give the bot an icon/Avatar of our choice .
Instead of rebuilding the vector database again when the same pdf is uploaded , the bot should have ability to store the previous database and use it again subsequently . This would save t…

View full answer

amithkoujalgi · 2023-12-17T19:25:11Z

amithkoujalgi
Dec 17, 2023
Maintainer

Hello @drmetro09. I have added a docker-compose setup for enabling GPU support for Ollama. Give that a try.

If you're having a Nvidia GPU and you've installed the correct drivers, you should be able to use the GPU with the docker-compose setup from this repo.

PS: If you're setting up and running the setup for the very first time, you will need to wait for the model to download and the embedding model to load. Once these steps are done, the responses should be pretty fast.

Also, I've added benchmarks in the readme file for reference.

Happy exploring!

0 replies

drmetro09 · 2023-12-18T04:49:46Z

drmetro09
Dec 18, 2023
Author

Hi , thanks for quick response but when gpu support is enabled it gives cublas error 15 and the bot doesn’t not give any response i have sent you a request on insta could you please guide me with this am new to this

0 replies

drmetro09 · 2023-12-18T06:08:25Z

drmetro09
Dec 18, 2023
Author

0 replies

amithkoujalgi · 2023-12-18T07:59:33Z

amithkoujalgi
Dec 18, 2023
Maintainer

The error is originating from the Ollama server module. Do you have the correct Nvidia drivers installed? If you have, could you share some screenshots for the same? And maybe share the details of your execution environment to get a better understanding about the issue.

You could have a look at the Ollama docker setup documentation to see if it helps you.

There seems to be a similar issue reported in Ollama repo. See if you can correlate your issue to that.

Also,

i have sent you a request on insta

I didn't quite understand this. :)

0 replies

drmetro09 · 2023-12-18T12:02:50Z

drmetro09
Dec 18, 2023
Author

Have tried various ollama models like orca-mini , llama2, starling-lm all are behaving same

0 replies

drmetro09 · 2023-12-18T17:42:08Z

drmetro09
Dec 18, 2023
Author

Update: Its finally working now , i downgraded from to v 0.1.10 and everything working fine its using the gpu without errors, tried using LLAMA2 model haven’t tried other models.
Thanks for your amazing work , i have few suggestions listed below , hope you would consider them for this project.

Increase size of pdf to be uploaded to more than 200mb maybe 600mb .
Option to include docx ,ppt , CSV , jpeg , jpg and other formats which are commonly used.
Option to give the bot an icon/Avatar of our choice .
Instead of rebuilding the vector database again when the same pdf is uploaded , the bot should have ability to store the previous database and use it again subsequently . This would save time and makes the output fast.
Ability to download and select various ollama models from the web UI of pdf bot
using the bot for general chat besides the docs QnA.
( There is an amazing repo Private GPT for inspiration which satisfies the above points but its very complex to install and run from a perspective of non-IT guy )

2 replies

deepakitkar Jun 14, 2024

hi @drmetro09 could you please help me with setting this up. I have installed Docker-Desktop and have pulled the repository. I see this error in the log: requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=11434): Max retries exceeded with url: /api/pull (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7fd87933c070>: Failed to establish a new connection: [Errno 111] Connection refused'))
I checked the ollama is running on localhost:11434.
Under bindmounts not sure why the Network Settings has different local ip and gateway?
Any help?

drmetro09 Jun 14, 2024
Author

Use the local ip like 192.168.0. whatever you have:11434 or use host.docker.internal:11434 in ollama configuration in docker compose file of pdf bot

amithkoujalgi · 2023-12-18T17:43:21Z

amithkoujalgi
Dec 18, 2023
Maintainer

Great to hear that you were able to get it working! I personally have tried different models and found Llama2 to be the best suited for my needs as the responses were more apt.

About the feature/enhancement suggestions:

I started this project just as an exploration work, but ended up creating a functional chatbot for PDFs. :)
I'd love to take up these enhancements but I do not have enough bandwidth to work on these at the moment.
I'd be happy to have collaborators for the repo who can contribute and build more features.

Cheers!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long response time , gpu support? #2

{{title}}

Replies: 7 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Long response time , gpu support? #2

drmetro09 Dec 17, 2023

Replies: 7 comments · 2 replies

amithkoujalgi Dec 17, 2023 Maintainer

drmetro09 Dec 18, 2023 Author

drmetro09 Dec 18, 2023 Author

amithkoujalgi Dec 18, 2023 Maintainer

drmetro09 Dec 18, 2023 Author

drmetro09 Dec 18, 2023 Author

deepakitkar Jun 14, 2024

drmetro09 Jun 14, 2024 Author

amithkoujalgi Dec 18, 2023 Maintainer

drmetro09
Dec 17, 2023

Replies: 7 comments 2 replies

amithkoujalgi
Dec 17, 2023
Maintainer

drmetro09
Dec 18, 2023
Author

drmetro09
Dec 18, 2023
Author

amithkoujalgi
Dec 18, 2023
Maintainer

drmetro09
Dec 18, 2023
Author

drmetro09
Dec 18, 2023
Author

drmetro09 Jun 14, 2024
Author

amithkoujalgi
Dec 18, 2023
Maintainer