Skip to content

SDXL seems to not train self_attn layers in Text Encoders #1952

@ghost

Description

Hello, I noticed that the recent version only trains MLP in Text Encoders, whereas existing LoRAs or LoRAs trained with GUI version of kohya-ss (that uses older version) seem to train all layers. Is it a mistake on my side? I couldn't find any option to control it.

This is what usually gets trained:

lora_te1_text_model_encoder_layers_0_mlp_fc1.alpha
lora_te1_text_model_encoder_layers_0_mlp_fc1.lora_down.weight
lora_te1_text_model_encoder_layers_0_mlp_fc1.lora_up.weight
lora_te1_text_model_encoder_layers_0_mlp_fc2.alpha
lora_te1_text_model_encoder_layers_0_mlp_fc2.lora_down.weight
lora_te1_text_model_encoder_layers_0_mlp_fc2.lora_up.weight
lora_te1_text_model_encoder_layers_0_self_attn_k_proj.alpha
lora_te1_text_model_encoder_layers_0_self_attn_k_proj.lora_down.weight
lora_te1_text_model_encoder_layers_0_self_attn_k_proj.lora_up.weight
lora_te1_text_model_encoder_layers_0_self_attn_out_proj.alpha
lora_te1_text_model_encoder_layers_0_self_attn_out_proj.lora_down.weight
lora_te1_text_model_encoder_layers_0_self_attn_out_proj.lora_up.weight
lora_te1_text_model_encoder_layers_0_self_attn_q_proj.alpha
lora_te1_text_model_encoder_layers_0_self_attn_q_proj.lora_down.weight
lora_te1_text_model_encoder_layers_0_self_attn_q_proj.lora_up.weight
lora_te1_text_model_encoder_layers_0_self_attn_v_proj.alpha
lora_te1_text_model_encoder_layers_0_self_attn_v_proj.lora_down.weight
lora_te1_text_model_encoder_layers_0_self_attn_v_proj.lora_up.weight

This is what I see in my attempts to use newest version of sdxl_train_network.py:

lora_te1_text_model_encoder_layers_0_mlp_fc1.alpha
lora_te1_text_model_encoder_layers_0_mlp_fc1.lora_down.weight
lora_te1_text_model_encoder_layers_0_mlp_fc1.lora_up.weight
lora_te1_text_model_encoder_layers_0_mlp_fc2.alpha
lora_te1_text_model_encoder_layers_0_mlp_fc2.lora_down.weight
lora_te1_text_model_encoder_layers_0_mlp_fc2.lora_up.weight

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions