Skip to content

[Question]: How do I respond slowly to concurrent requests for interfaces /api/v1/chats/{chat_id}/completions? #5183

Closed
@xyk0930

Description

@xyk0930

Describe your problem

  1. The response time is about 50s when there is only one request
  2. When there are 10 concurrent requests, the last response time is 3min40s
  3. Is this because of the ragflow service itself or because the LLM is not friendly to concurrent requests?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions