Skip to content

bug: nitro cuda windows low performance on machine has multiple GPUs - tested using Jan App #269

Closed
@hiento09

Description

@hiento09

Describe the bug
My windows machine has 3 GPUs, when I enabled all 3 GPUs, the token speed was slow (6-9/s) and it even not able to load tinyllama 1B. When I disabled 2 GPUs, 1 active only, the performance was back to normal

Screenshots

  • 3 GPUs active

    • Low performance
      image
    • Load tinyllama error
      image
  • 1 GPU active only, then the performance was back to normal
    image

Desktop (please complete the following information):

  • OS: Windows 11
  • Nvidia driver: 531.18
  • cuda version: 12.3
  • Nitro version: 0.1.27
  • GPU:
  • 1 RTX 4070ti
  • 2 RTX 1660ti

Metadata

Metadata

Labels

type: bugSomething isn't working

Type

No type

Projects

Status

Completed

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions