Skip to content
This repository was archived by the owner on Aug 7, 2024. It is now read-only.
This repository was archived by the owner on Aug 7, 2024. It is now read-only.

[Perf] Explore more performant Fp8 Casting #83

Closed
@drisspg

Description

@drisspg

Summary

There are two components to this, non_saturated casting and saturated casting.

Non-Saturated casting

  • We are currently using bit logic to cast from fp32 to fp8 where as there exists intrinsics to perform the same, see Nikitas comment below.
  • Currently for fp16 -> fp8 casting we actually first rescaled fp16 to fp32 and then recast to fp8.

Saturated Casting

Metadata

Metadata

Assignees

No one assigned

    Labels

    PerfIssues related to perf optimizations

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions