Skip to content

Any optimization options for H100? #2107

Closed as not planned
Closed as not planned
@Archmilio

Description

@Archmilio

Thank you for your hard work.

The performance difference between A100 and H100 is not significant. I used the official VLLM image 0.2.4 on Docker Hub.

I set the prompt and completion to 500, and both A100 and H100 take 19 seconds.

Are there any settings to optimize performance on H100?

Metadata

Metadata

Assignees

No one assigned

    Labels

    performancePerformance-related issuesstaleOver 90 days of inactivity

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions