Skip to content

Commit

Permalink
[Bugfix] Fix CUDA version check for mma warning suppression (vllm-pro…
Browse files Browse the repository at this point in the history
  • Loading branch information
tlrmchlsmth authored and jimpang committed Jul 24, 2024
1 parent 74e6232 commit eba0837
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion csrc/quantization/marlin/sparse/common/mma.h
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@

#pragma once
#include "base.h"
#include <cudaTypedefs.h>

namespace marlin_24 {

Expand All @@ -26,7 +27,7 @@ namespace marlin_24 {
// | Advisory: Modifier ‘.sp::ordered_metadata’ should be used on instruction
// | ‘mma’ instead of modifier ‘.sp’ as it is expected to have substantially
// | reduced performance on some future architectures
#if defined CUDA_VERSION && CUDA_VERSION >= 12500
#if defined CUDA_VERSION && CUDA_VERSION >= 12050
#define MMA_SP_INST \
"mma.sp::ordered_metadata.sync.aligned.m16n8k32.row.col.f32.f16.f16.f32 "
#else
Expand Down

0 comments on commit eba0837

Please sign in to comment.