mtmd : fix glm-edge redundant token count #13139

ngxson · 2025-04-27T17:40:58Z

libmtmd need to exclude the 2 redundant BOI/EOI embeddings as they are now processed as text token.

For unknown reason, the provided jinja chat template (without [gMASK]<sop>) doesn't work. Using GLM4 template with [gMASK]<sop>, the model does response normally.

mtmd : fix glm-edge redundant token count

b33c312

ngxson requested a review from ggerganov April 27, 2025 17:40

github-actions bot added the examples label Apr 27, 2025

ngxson added 4 commits April 27, 2025 22:01

Merge branch 'master' into xsn/mtmd_rm_glm_eoi_boi

971aa25

fix chat template

4d01373

temporary disable GLMEdge test chat tmpl

ed3a496

Merge branch 'master' into xsn/mtmd_rm_glm_eoi_boi

cdcf561

ggerganov approved these changes Apr 28, 2025

View reviewed changes

github-actions bot added the testing Everything test related label Apr 28, 2025

ngxson merged commit 4e87962 into ggml-org:master Apr 28, 2025
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mtmd : fix glm-edge redundant token count #13139

mtmd : fix glm-edge redundant token count #13139

Uh oh!

ngxson commented Apr 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

mtmd : fix glm-edge redundant token count #13139

mtmd : fix glm-edge redundant token count #13139

Uh oh!

Conversation

ngxson commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson commented Apr 27, 2025 •

edited

Loading