ggml : add ggml_gelu_erf() #13667

ngxson · 2025-05-20T16:48:03Z

I couldn't add it as a param because ggml_gelu is an unary op and I'm not quite sure how to add a new param to unary op (Please tell me if I should still add it as a param instead)

Despite the name "not approximated", I'm actually using an approximation (more accurate than tanh) in Metal impl because there is no erf() function build-in on Metal. The built-in GNU impl also seems to base on a complicated approx, so I think all systems are now using some short of approx, it's just more complicated than tanh so it gives a better result.

So, maybe ggml_gelu_na is not the best name.

I haven't tested this with ultravox, will report back the result a bit later.

ggerganov · 2025-05-20T16:57:31Z

So, maybe ggml_gelu_na is not the best name.

ggml_gelu_erf()?

ngxson · 2025-05-20T18:20:13Z

I retried the ultravox and it is now giving the correct result on both metal and cpu

Also just to note that, the original whisper impl from openai seem to use this type of gelu (with approximation = none by default), so probably this change will further increase the precision of whisper.cpp. We just need to change all the gelu to gelu_erf (including the one in conv block), cc @danbev if you are interested in doing a test

ngxson · 2025-05-20T18:22:38Z

Here is the GGML output (conv + transformer + post layer norm):

after output norm.shape = [1280, 512]
after output norm.data: [
     [
      [ -0.1812,  -0.4413,  -0.0398, ...,  -0.3310,   0.0529,  -0.4211],
      [ -0.2426,  -0.7433,   0.0328, ...,  -0.1948,  -0.2125,  -0.3387],
      [ -0.1757,  -0.6223,  -0.1927, ...,  -0.3855,  -0.0607,  -0.3789],
      ..., 
      [ -0.2057,   0.0134,  -0.1241, ...,   0.0104,   0.3419,  -0.1225],
      [ -0.1576,  -0.3597,  -0.0085, ...,   0.0309,  -0.0150,  -0.2440],
      [ -0.1471,  -0.1170,  -0.1523, ...,   0.1099,   0.1881,  -0.2656],
     ],
    ]
after output norm sum.shape = [1]
after output norm sum.data: [
     [
      [-478.1041],
     ],
    ]

And python output:

tensor([[[-0.1812, -0.4412, -0.0399,  ..., -0.3313,  0.0527, -0.4211],
         [-0.2425, -0.7431,  0.0330,  ..., -0.1949, -0.2127, -0.3389],
         [-0.1757, -0.6225, -0.1928,  ..., -0.3854, -0.0605, -0.3790],
         ...,
         [-0.2057,  0.0133, -0.1241,  ...,  0.0102,  0.3418, -0.1227],
         [-0.1576, -0.3597, -0.0084,  ...,  0.0308, -0.0150, -0.2442],
         [-0.1471, -0.1172, -0.1523,  ...,  0.1097,  0.1881, -0.2657]]]) tensor(-477.8432)

ggml/include/ggml.h

ggml/src/ggml-cpu/ggml-cpu.c

* ggml : add ggml_gelu_na (not approximated) * fix naming order * rename na --> erf * apply review suggesions * revert naming order

ggml : add ggml_gelu_na (not approximated)

c78210a

ngxson requested a review from ggerganov May 20, 2025 16:48

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels May 20, 2025

fix naming order

bddf57b

rename na --> erf

65730bc

ngxson changed the title ~~ggml : add ggml_gelu_na (not approximated)~~ ggml : add ggml_gelu_erf() May 20, 2025

ngxson mentioned this pull request May 20, 2025

mtmd : add ultravox audio input #13623

Merged

ggerganov approved these changes May 21, 2025

View reviewed changes

ggml/include/ggml.h Outdated Show resolved Hide resolved

ggml/src/ggml-cpu/ggml-cpu.c Outdated Show resolved Hide resolved

ngxson added 2 commits May 21, 2025 11:05

apply review suggesions

e3b7b98

revert naming order

f59a199

ngxson merged commit cf4cb59 into ggml-org:master May 21, 2025
46 checks passed

infil00p pushed a commit to baseweight/llama.cpp that referenced this pull request May 22, 2025

ggml : add ggml_gelu_erf() (ggml-org#13667)

d6406e4

* ggml : add ggml_gelu_na (not approximated) * fix naming order * rename na --> erf * apply review suggesions * revert naming order

This was referenced May 23, 2025

ggml : fix the order of ggml_unary_op #13718

Merged

ggml: fix GGML_UNARY_OP_NAME order to align with enum ggml_unary_op #13717

Closed

ggml : add ggml_gelu_erf() CUDA kernel #13719

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : add ggml_gelu_erf() #13667

ggml : add ggml_gelu_erf() #13667

Uh oh!

ngxson commented May 20, 2025 •

edited

Loading

Uh oh!

ggerganov commented May 20, 2025

Uh oh!

ngxson commented May 20, 2025 •

edited

Loading

Uh oh!

ngxson commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ggml : add ggml_gelu_erf() #13667

ggml : add ggml_gelu_erf() #13667

Uh oh!

Conversation

ngxson commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented May 20, 2025

Uh oh!

ngxson commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson commented May 20, 2025 •

edited

Loading

ngxson commented May 20, 2025 •

edited

Loading