ggml : make ggml_fp16_t private #720

ggerganov · 2024-01-31T20:10:47Z

Currently the ggml_fp16_t typedef is exposed in the public API:

Lines 317 to 332 in 6b14d73

    
           #if defined(__ARM_NEON) && defined(__CUDACC__) 
        
               typedef half ggml_fp16_t; 
        
           #elif defined(__ARM_NEON) && !defined(_MSC_VER) 
        
               typedef __fp16 ggml_fp16_t; 
        
           #else 
        
               typedef uint16_t ggml_fp16_t; 
        
           #endif 
        
               // convert FP16 <-> FP32 
        
               GGML_API float       ggml_fp16_to_fp32(ggml_fp16_t x); 
        
               GGML_API ggml_fp16_t ggml_fp32_to_fp16(float x); 
        
               GGML_API void ggml_fp16_to_fp32_row(const ggml_fp16_t * x, float * y, int n); 
        
               GGML_API void ggml_fp32_to_fp16_row(const float * x, ggml_fp16_t * y, int n);

Since this type is platform specific, it would make sense to hide it by moving it in ggml-impl.h.
We will still expose an API for F16 <-> F32 conversions, but it sill operate on void * instead of ggml_fp16_t

The text was updated successfully, but these errors were encountered:

slaren · 2024-01-31T20:16:40Z

We could probably define it to uint16_t always, and only cast it to __fp16 in the ARM code that can take advantage of that. The CUDA one is weird, it should only be used when compiling ggml-cuda.cu, which never uses this type anyway.

ggerganov · 2024-01-31T20:31:38Z

Ah yes, doing so would make this change much simpler

ggerganov added the refactoring Refactoring label Jan 31, 2024

ggerganov self-assigned this Feb 22, 2024

ggerganov mentioned this issue Feb 22, 2024

ggml : always define ggml_fp16_t as uint16_t ggerganov/llama.cpp#5666

Merged

ggerganov closed this as completed Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : make ggml_fp16_t private #720

ggml : make ggml_fp16_t private #720

ggerganov commented Jan 31, 2024

slaren commented Jan 31, 2024

ggerganov commented Jan 31, 2024

ggml : make ggml_fp16_t private #720

ggml : make ggml_fp16_t private #720

Comments

ggerganov commented Jan 31, 2024

slaren commented Jan 31, 2024

ggerganov commented Jan 31, 2024