Commit d86c30c
committed
fix: Use correct GGUF metadata keys for softcapping
The GGUF metadata stores softcapping as:
- gemma2.attn_logit_softcapping
- gemma2.final_logit_softcapping
After stripping the architecture prefix, the mapping keys should be
'attn_logit_softcapping' and 'final_logit_softcapping', not
'attention.logit_softcapping' which doesn't exist in GGUF metadata.
Also adds final_logit_softcapping mapping which was missing.
Signed-off-by: Christina <truffle@gmail.com>1 parent cecafdd commit d86c30c
File tree
2 files changed
+13
-7
lines changed- src/transformers/integrations
- tests/quantization/ggml
2 files changed
+13
-7
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
245 | 245 | | |
246 | 246 | | |
247 | 247 | | |
248 | | - | |
| 248 | + | |
| 249 | + | |
249 | 250 | | |
250 | 251 | | |
251 | 252 | | |
| |||
262 | 263 | | |
263 | 264 | | |
264 | 265 | | |
265 | | - | |
| 266 | + | |
| 267 | + | |
266 | 268 | | |
267 | 269 | | |
268 | 270 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1045 | 1045 | | |
1046 | 1046 | | |
1047 | 1047 | | |
1048 | | - | |
| 1048 | + | |
1049 | 1049 | | |
1050 | 1050 | | |
1051 | 1051 | | |
1052 | 1052 | | |
1053 | 1053 | | |
1054 | | - | |
1055 | | - | |
| 1054 | + | |
| 1055 | + | |
| 1056 | + | |
| 1057 | + | |
1056 | 1058 | | |
1057 | 1059 | | |
1058 | 1060 | | |
1059 | 1061 | | |
1060 | | - | |
1061 | | - | |
| 1062 | + | |
| 1063 | + | |
| 1064 | + | |
| 1065 | + | |
1062 | 1066 | | |
1063 | 1067 | | |
1064 | 1068 | | |
| |||
0 commit comments