Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Multi-GPU support #2528

Open
otoloui opened this issue Jul 3, 2024 · 1 comment
Open

[Feature] Multi-GPU support #2528

otoloui opened this issue Jul 3, 2024 · 1 comment
Labels
backend gpt4all-backend issues chat gpt4all-chat issues enhancement New feature or request

Comments

@otoloui
Copy link

otoloui commented Jul 3, 2024

Bug Report

Right now, GPT4All only utilizes 1 GPU so for machines with multiple GPU's, it blocks them from having access to higher parameter count models to use.

Steps to Reproduce

  1. 2x RTX 3090 installed
  2. Download llama-3-70b
  3. Try and load the model and watch it load one GPU with 24gb and then crash, 2nd GPU is not utilized

Expected Behavior

In rigs where there are multiple GPU's the app should be able to split the models across them - enabling users to have access of higher parameter count models.

Your Environment

  • GPT4All version: [v3.0.0]
  • Operating System: Windows 11
  • Chat model used (if applicable):
@otoloui otoloui added bug-unconfirmed chat gpt4all-chat issues labels Jul 3, 2024
@theLoDD
Copy link

theLoDD commented Jul 24, 2024

I have the same issue. I cannot use big models with multiple GPU's with GPT4All. I can only use a one GPU. Version 3.0.0, Win10, cards 8xGTX1070_8GB.

@cebtenzzre cebtenzzre added enhancement New feature or request backend gpt4all-backend issues and removed bug-unconfirmed labels Aug 4, 2024
@cebtenzzre cebtenzzre changed the title GPT4All won't load models across multi-GPU's [Feature] Multi-GPU support Aug 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend gpt4all-backend issues chat gpt4all-chat issues enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants