Skip to content

higher latency than TGI #335

Closed
Closed
@gravitywp

Description

@gravitywp

Is it normal to have higher latency than TGI with a low concurrency, such as 1 or 4?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions