[Feat]: Add support for meta llama hf model conversion #286

Nick-infinity · 2023-08-14T04:50:26Z

Description:
Llama 2 hf models have weights stored with diff name

Description: Llama 2 hf models have weights stored with diff name Signed-off-by: Nikhil Gupta <nikhilg.me@gmail.com>

Nick-infinity · 2023-08-14T05:17:14Z

Just parking it here for others. Merge is not required

karpathy · 2023-08-14T14:46:36Z

I'll take it because i expect a lot of finetunes will be in HF format. Ty.

mukel · 2023-08-15T14:48:37Z

The HF models apply a permutation to wq and wk, so the RoPE is different fro the original Llama 2 models. The permutation must be undone to export from HF to llam2a.c's simple .bin files.
It took me quite some time to figure this out, the symptom is it starts degenerating and printing gibberish after ~20 or more tokens.

See https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/convert_llama_weights_to_hf.py#L113-L115

Nick-infinity · 2023-08-15T14:58:46Z

Yes , I have noticed the gibberish too. That's why I left the comment in the script for RoPe encodings extraction. Thanks for validating it

…

On Tue, Aug 15, 2023, 8:18 PM Alfonso² Peterssen ***@***.***> wrote: The HF models apply a permutation to k and v, so the RoPE is different fro the original Llama 2 models. The permutation must be undone to export from HF to llam2a.c's simple .bin files. It took me quite some time to figure this out, the symptom is it starts degenerating and printing gibberish after ~20 or more tokens. — Reply to this email directly, view it on GitHub <#286 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIHM53ZCZKX72DEMY2GWSC3XVOD5DANCNFSM6AAAAAA3PFC2BE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

karpathy · 2023-08-15T15:09:42Z

... PR fix welcome

calvintwr · 2023-08-17T11:29:09Z

I don't think this is working. #314 @Nick-infinity @karpathy

atamurad · 2023-08-21T02:44:26Z

The HF models apply a permutation to wq and wk, so the RoPE is different fro the original Llama 2 models. The permutation must be undone to export from HF to llam2a.c's simple .bin files. It took me quite some time to figure this out, the symptom is it starts degenerating and printing gibberish after ~20 or more tokens.

See https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/convert_llama_weights_to_hf.py#L113-L115

@mukel thanks for tracking this down, fixed it here: #326

[Feat]: Add support for meta llama hf model conversion

[Feat]: Add support for meta llama hf model conversion

c39f19f

Description: Llama 2 hf models have weights stored with diff name Signed-off-by: Nikhil Gupta <nikhilg.me@gmail.com>

Nick-infinity mentioned this pull request Aug 14, 2023

How to export llama2_7b_hf.bin instead of llama2_7b.bin? #227

Closed

karpathy merged commit 013e012 into karpathy:master Aug 14, 2023

This was referenced Aug 19, 2023

Quantization Brainstorming #277

Open

Added huggingface model loader/importer to export.py #326

Merged

vinhtran2611 pushed a commit to vinhtran2611/llama2.c that referenced this pull request Jan 20, 2024

Merge pull request karpathy#286 from Nick-infinity/master

4e6b97c

[Feat]: Add support for meta llama hf model conversion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat]: Add support for meta llama hf model conversion #286

[Feat]: Add support for meta llama hf model conversion #286

Nick-infinity commented Aug 14, 2023

Nick-infinity commented Aug 14, 2023

karpathy commented Aug 14, 2023

mukel commented Aug 15, 2023 •

edited

Loading

Nick-infinity commented Aug 15, 2023 via email

karpathy commented Aug 15, 2023

calvintwr commented Aug 17, 2023 •

edited

Loading

atamurad commented Aug 21, 2023

[Feat]: Add support for meta llama hf model conversion #286

[Feat]: Add support for meta llama hf model conversion #286

Conversation

Nick-infinity commented Aug 14, 2023

Nick-infinity commented Aug 14, 2023

karpathy commented Aug 14, 2023

mukel commented Aug 15, 2023 • edited Loading

Nick-infinity commented Aug 15, 2023 via email

karpathy commented Aug 15, 2023

calvintwr commented Aug 17, 2023 • edited Loading

atamurad commented Aug 21, 2023

mukel commented Aug 15, 2023 •

edited

Loading

calvintwr commented Aug 17, 2023 •

edited

Loading