[Question] Reason behind removing `lm_head` in `modules` #13

NanoCode012 · 2023-05-25T02:55:23Z

Hello,

Thank you for the amazing repo. I was curious about this code below.

Lines 221 to 222 in e381744

    
           if 'lm_head' in lora_module_names: # needed for 16-bit 
        
               lora_module_names.remove('lm_head')

Why is lm_head removed? What does it mean by for "needed for 16-bit"? Does it mean targeting this module for fp16 or so is incorrect?

The text was updated successfully, but these errors were encountered:

mallorbc · 2023-09-20T23:13:05Z

Had the same thought. Have you figured it out? I didn't see anything in the paper either. If you want to add new tokens, you need to target the lm_head anyways.

mallorbc · 2023-09-20T23:56:17Z

@artidoro @TimDettmers some insight on this would be greatly appreciated.

anhdungitvn · 2023-11-28T09:48:44Z

Could someone share any observations or evaluations regarding the use of 'lm_head' as a target module?
Thanks!

dandingsky · 2023-12-09T16:48:36Z

Not sure if this is related with this issue, but I found that when applying lora to Llama-2, and include target_modules=['lm_head', 'q_proj', 'v_proj'] in LoraConfig, the lora on lm_head will be removed when the model is being distributed to gpus. I was trying to apply lora on both embed_tokens and lm_head, but peft or deepspeed seems to forbid lora on these 2 modules without explicit warning.

jvlinsta mentioned this issue Sep 18, 2024

[Feature][kernel] tensor parallelism with bitsandbytes quantization vllm-project/vllm#8434

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Reason behind removing `lm_head` in `modules` #13

[Question] Reason behind removing `lm_head` in `modules` #13

NanoCode012 commented May 25, 2023 •

edited

Loading

mallorbc commented Sep 20, 2023

mallorbc commented Sep 20, 2023

anhdungitvn commented Nov 28, 2023

dandingsky commented Dec 9, 2023 •

edited

Loading

[Question] Reason behind removing lm_head in modules #13

[Question] Reason behind removing lm_head in modules #13

Comments

NanoCode012 commented May 25, 2023 • edited Loading

mallorbc commented Sep 20, 2023

mallorbc commented Sep 20, 2023

anhdungitvn commented Nov 28, 2023

dandingsky commented Dec 9, 2023 • edited Loading

[Question] Reason behind removing `lm_head` in `modules` #13

[Question] Reason behind removing `lm_head` in `modules` #13

NanoCode012 commented May 25, 2023 •

edited

Loading

dandingsky commented Dec 9, 2023 •

edited

Loading