Skip to content

Integrate TransformerEngine #1098

@Quentin-Anthony

Description

@Quentin-Anthony

Needed for fp8 training, and adds some nice fp16/bf16 optimizations for Ampere and newer architectures that we can make use of regardless.

https://github.com/EleutherAI/TransformerEngine

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions