Skip to content

Commit

Permalink
[ROCm][Quantization][Kernel] Use FP8 FNUZ when OCP flag is 0 or undef…
Browse files Browse the repository at this point in the history
…ined (vllm-project#13851)

Signed-off-by: Hollow Man <hollowman@opensuse.org>
  • Loading branch information
HollowMan6 authored Feb 27, 2025
1 parent f959039 commit a31614e
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions csrc/quantization/fp8/amd/quant_utils.cuh
Original file line number Diff line number Diff line change
Expand Up @@ -24,12 +24,12 @@ __inline__ __device__ Tout scaled_vec_conversion(const Tin& x,
return x;
}

#if HIP_FP8_TYPE_FNUZ
using fp8_type = __hip_fp8_e4m3_fnuz;
using fp8x2_type = __hip_fp8x2_e4m3_fnuz;
#elif HIP_FP8_TYPE_OCP
#if HIP_FP8_TYPE_OCP
using fp8_type = __hip_fp8_e4m3;
using fp8x2_type = __hip_fp8x2_e4m3;
#else
using fp8_type = __hip_fp8_e4m3_fnuz;
using fp8x2_type = __hip_fp8x2_e4m3_fnuz;
#endif

// fp8 -> half
Expand Down

0 comments on commit a31614e

Please sign in to comment.