Update to sd-script latest update

Vitalya-79 · Mar 9, 2023 · 2deddd5 · 2deddd5
1 parent 2eb7b3b
commit 2deddd5
Show file tree

Hide file tree

Showing 20 changed files with 1,917 additions and 2,201 deletions.
diff --git a/README-ja.md b/README-ja.md
@@ -16,9 +16,10 @@ GUIやPowerShellスクリプトなど、より使いやすくする機能が[bma
 
 当リポジトリ内およびnote.comに記事がありますのでそちらをご覧ください（将来的にはすべてこちらへ移すかもしれません）。
 
+* [学習について、共通編](./train_README-ja.md) : データ整備やオプションなど
+    * [データセット設定](./config_README-ja.md)
 * [DreamBoothの学習について](./train_db_README-ja.md)
 * [fine-tuningのガイド](./fine_tune_README_ja.md):
-BLIPによるキャプショニングと、DeepDanbooruまたはWD14 taggerによるタグ付けを含みます
 * [LoRAの学習について](./train_network_README-ja.md)
 * [Textual Inversionの学習について](./train_ti_README-ja.md)
 * note.com [画像生成スクリプト](https://note.com/kohya_ss/n/n2693183a798e)
@@ -131,6 +132,8 @@ pip install --use-pep517 --upgrade -r requirements.txt
 
 LoRAの実装は[cloneofsimo氏のリポジトリ](https://github.com/cloneofsimo/lora)を基にしたものです。感謝申し上げます。
 
+Conv2d 3x3への拡大は [cloneofsimo氏](https://github.com/cloneofsimo/lora) が最初にリリースし、KohakuBlueleaf氏が [LoCon](https://github.com/KohakuBlueleaf/LoCon) でその有効性を明らかにしたものです。KohakuBlueleaf氏に深く感謝します。
+
 ## ライセンス
 
 スクリプトのライセンスはASL 2.0ですが（Diffusersおよびcloneofsimo氏のリポジトリ由来のものも同様）、一部他のライセンスのコードを含みます。

diff --git a/README.md b/README.md
@@ -176,13 +176,25 @@ This will store your a backup file with your current locally installed pip packa
 
 ## Change History
 
-* 2023/03/05 (v21.2.0):
+* 2023/03/09 (v21.2.0):
     - Fix issue https://github.com/bmaltais/kohya_ss/issues/335
     - Add option to print LoRA trainer command without executing it
     - Add support for samples during trainin via a new `Sample images config` accordion in the `Training parameters` tab.
     - Added new `Additional parameters` under the `Advanced Configuration` section of the `Training parameters` tab to allow for the specifications of parameters not handles by the GUI.
     - Added support for sample as a new Accordion under the `Training parameters` tab. More info about the prompt options can be found here: https://github.com/kohya-ss/sd-scripts/issues/256#issuecomment-1455005709
-    - There may be problems due to major changes. If you cannot revert back to a previous version when problems occur (`git checkout <release name>`).
+    - There may be problems due to major changes. If you cannot revert back to the previous version when problems occur, please do not update for a while.
+    - Minimum metadata (module name, dim, alpha and network_args) is recorded even with `--no_metadata`, issue https://github.com/kohya-ss/sd-scripts/issues/254
+    - `train_network.py` supports LoRA for Conv2d-3x3 (extended to conv2d with a kernel size not 1x1).
+        - Same as a current version of [LoCon](https://github.com/KohakuBlueleaf/LoCon). __Thank you very much KohakuBlueleaf for your help!__
+        - LoCon will be enhanced in the future. Compatibility for future versions is not guaranteed.
+        - Specify `--network_args` option like: `--network_args "conv_dim=4" "conv_alpha=1"`
+        - [Additional Networks extension](https://github.com/kohya-ss/sd-webui-additional-networks) version 0.5.0 or later is required to use 'LoRA for Conv2d-3x3' in Stable Diffusion web UI.
+        - __Stable Diffusion web UI built-in LoRA does not support 'LoRA for Conv2d-3x3' now. Consider carefully whether or not to use it.__
+    - Merging/extracting scripts also support LoRA for Conv2d-3x3.
+    - Free CUDA memory after sample generation to reduce VRAM usage, issue https://github.com/kohya-ss/sd-scripts/issues/260 
+    - Empty caption doesn't cause error now, issue https://github.com/kohya-ss/sd-scripts/issues/258
+    - Fix sample generation is crashing in Textual Inversion training when using templates, or if height/width is not divisible by 8.
+    - Update documents (Japanese only).
     - Dependencies are updated, Please [upgrade](#upgrade) the repo.
     - Add detail dataset config feature by extra config file. Thanks to fur0ut0 for this great contribution!
         - Documentation is [here](https://github-com.translate.goog/kohya-ss/sd-scripts/blob/main/config_README-ja.md) (only in Japanese currently.)
@@ -197,6 +209,31 @@ This will store your a backup file with your current locally installed pip packa
     - Add `--tokenizer_cache_dir` to each training and generation scripts to cache Tokenizer locally from Diffusers.
         - Scripts will support offline training/generation after caching.
     - Support letents upscaling for highres. fix, and VAE batch size in `gen_img_diffusers.py` (no documentation yet.)
+
+    - Sample image generation:
+        A prompt file might look like this, for example
+
+        ```
+        # prompt 1
+        masterpiece, best quality, 1girl, in white shirts, upper body, looking at viewer, simple background --n low quality, worst quality, bad anatomy,bad composition, poor, low effort --w 768 --h 768 --d 1 --l 7.5 --s 28
+
+        # prompt 2
+        masterpiece, best quality, 1boy, in business suit, standing at street, looking back --n low quality, worst quality, bad anatomy,bad composition, poor, low effort --w 576 --h 832 --d 2 --l 5.5 --s 40
+        ```
+
+        Lines beginning with `#` are comments. You can specify options for the generated image with options like `--n` after the prompt. The following can be used.
+
+        * `--n` Negative prompt up to the next option.
+        * `--w` Specifies the width of the generated image.
+        * `--h` Specifies the height of the generated image.
+        * `--d` Specifies the seed of the generated image.
+        * `--l` Specifies the CFG scale of the generated image.
+        * `--s` Specifies the number of steps in the generation.
+
+        The prompt weighting such as `( )` and `[ ]` are not working.
+
+    Please read [Releases](https://github.com/kohya-ss/sd-scripts/releases) for recent updates.
+
 * 2023/03/05 (v21.1.5):
     - Add replace underscore with space option to WD14 captioning. Thanks @sALTaccount!
     - Improve how custom preset is set and handles.