Add GGUF BF16 dtype support #2387

EricLBuehler · 2024-08-02T01:39:50Z

Currently, the GgmlDType only supports F16 and not BF16. This PR introduces support for the BF16 type.

I would appreciate a check if this looks good! I have tested with success on my machine which has avx and f16c, and the CUDA tests also pass even though no changes were necessary.

I also noted that there will be a confusing situation in this case, though, if the tensor is part of a QMatMul. In this case (and for all other types not supported for quantized matmul in QStorage), we should perhaps dequantize and then perform the matmul using cublas? This modification could be made in QStorage::fwd, perhaps.

EricLBuehler · 2024-08-17T15:21:34Z

candle-core/src/quantized/cuda.rs

@@ -409,6 +409,7 @@ impl QCudaStorage {
        match self.dtype {
            GgmlDType::F32 => deq::<f32>(&buffer, block_len, &mut out)?,
            GgmlDType::F16 => deq::<half::f16>(&buffer, block_len, &mut out)?,
+            GgmlDType::BF16 => deq::<half::bf16>(&buffer, block_len, &mut out)?,


With #2424 this can be optimized!

Add GGUF bf16 type support

5f5d658

EricLBuehler mentioned this pull request Aug 2, 2024

unknown dtype for tensor (BF16?) EricLBuehler/mistral.rs#663

Closed

EricLBuehler added 2 commits August 1, 2024 21:43

Add non avx impl for vec_dot_bf16

64abbc8

Fix from_u32

49328fe

EricLBuehler mentioned this pull request Aug 17, 2024

Question: How to use quantized tensors? #1006

Closed

EricLBuehler commented Aug 17, 2024

View reviewed changes

Merge branch 'dev_main' into gguf_bf16

523b470

EricLBuehler mentioned this pull request Aug 17, 2024

Support GGUF BF16 tensors EricLBuehler/mistral.rs#691

Merged

EricLBuehler added 2 commits August 20, 2024 06:34

Fix loading

afc85c0

Fix dequant of bf16

8a874da

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GGUF BF16 dtype support #2387

Add GGUF BF16 dtype support #2387

EricLBuehler commented Aug 2, 2024

EricLBuehler Aug 17, 2024

Add GGUF BF16 dtype support #2387

Are you sure you want to change the base?

Add GGUF BF16 dtype support #2387

Conversation

EricLBuehler commented Aug 2, 2024

EricLBuehler Aug 17, 2024

Choose a reason for hiding this comment