Skip to content

Optimum-NVIDIA #279

Open
Open
@mshannon-sil

Description

@mshannon-sil

https://huggingface.co/blog/optimum-nvidia

They're seeing up to 128x faster inference when using Optimum-NVIDIA. We should also check whether this can be used in tandem with BetterTransformer.

Metadata

Metadata

Assignees

No one assigned

    Labels

    optimizationModel training/inferencing optimization

    Type

    No type

    Projects

    Status

    📋 Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions