Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Lora] Cannot convert baichuan2 qlora to GGUF #3519

Closed
lizhiling12345 opened this issue Oct 7, 2023 · 6 comments
Closed

[Lora] Cannot convert baichuan2 qlora to GGUF #3519

lizhiling12345 opened this issue Oct 7, 2023 · 6 comments
Labels

Comments

@lizhiling12345
Copy link

Error: unrecognized sub-layer self_attn.W_pack in tensor base_model.model.model.layers.0.self_attn.W_pack.lora_A.weight

@slaren
Copy link
Collaborator

slaren commented Oct 7, 2023

Can you test #3333?

@lizhiling12345
Copy link
Author

Can you test #3333?

i use llama.cpp-lora-falcon,still error,
python convert-lora-to-ggml.py /adapter_model baichuan
Error: could not map tensor name base_model.model.model.layers.0.self_attn.W_pack.lora_A.weight
Note: the arch parameter must be specified if the model is not llama
image

@lizhiling12345
Copy link
Author

Can you test #3333?

model is baichuan2,not baicun,can you teach me,how to fit baichuan2
image
image

@slaren
Copy link
Collaborator

slaren commented Oct 8, 2023

Looks like supporting baichuan loras is going to be a bit tricky than that, because the attention weights need to be unpacked. Ie. the W_pack needs to be unpacked into attn_q, attn_k and attn_v, like convert-baichuan-hf-to-gguf.py does. I don't really know how that works, so I can't help here, sorry.

@lizhiling12345
Copy link
Author

Looks like supporting baichuan loras is going to be a bit tricky than that, because the attention weights need to be unpacked. Ie. the W_pack needs to be unpacked into attn_q, attn_k and attn_v, like convert-baichuan-hf-to-gguf.py does. I don't really know how that works, so I can't help here, sorry.

Thanks for your reply

@slaren slaren mentioned this issue Nov 18, 2023
12 tasks
@github-actions github-actions bot added the stale label Mar 19, 2024
Copy link
Contributor

github-actions bot commented Apr 4, 2024

This issue was closed because it has been inactive for 14 days since being marked as stale.

@github-actions github-actions bot closed this as completed Apr 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants