Skip to content

TOML questions - validation loss and DoRA #2038

@Kahldris

Description

@Kahldris

1. Validation Dataset Setup (sd3 Branch Method):

  • To set up a single folder of validation images with the Validation loss  #1864 updates, is configuring a [[datasets]] block with validation_split = 1.0 the correct approach? (I saw mentions of an older is_validation = true flag elsewhere, so wanted to confirm the intended method for this branch/feature).
  • Should the validation dataset use the same batch_size as the main training dataset?
  • I'm using the sd3 branch because of the validation loss, does it still work fine to train SDXL?
  • Do captions have any effect or purpose for images within the validation dataset block when calculating validation loss?

Here's my current dataset.toml structure:

# --- Training Dataset Definition ---
[[datasets]]
  batch_size = 4
  resolution = [1024, 1024]
  enable_bucket = true
  min_bucket_reso = 512
  max_bucket_reso = 2048
  bucket_reso_steps = 64
  bucket_no_upscale = true
  [[datasets.subsets]]
    image_dir = "path"
    caption_extension = ".txt"
    num_repeats = 10
    shuffle_caption = true
    keep_tokens = 1
    flip_aug = true
    random_crop = false


# --- Validation Dataset Definition ---
[[datasets]]
  validation_split = 1.0
  batch_size = 4
  resolution = [1024, 1024]
  enable_bucket = true
  min_bucket_reso = 512
  max_bucket_reso = 2048
  bucket_reso_steps = 64
  bucket_no_upscale = true
  [[datasets.subsets]]
    image_dir = "val_path"
    caption_extension = ".txt"

2. Correct DoRA Implementation:

  • I've seen examples using LyCORIS (network_module="lycoris.kohya", network_args=["algo=dora", ...]) but encountered a KeyError: 'dora' when trying that approach after installing lycoris-lora.
  • Is using the standard LoRA module with the use_dora=True argument, like below, the correct and currently intended way to enable DoRA within the sd3 branch?
# From config.toml
network_module = "networks.lora"
network_args = ["use_dora=True"]
# (other network_dim, network_alpha settings...)

Thanks for any clarification!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions