Skip to content

Commit 8970fff

Browse files
committed
note down ControlNet related IDs, and "preprocessor" key specifically for those
this is an addition of a specialty non-altering optional key, which is considered a non-breaking change to the document (optional specialty keys are always allowed regardless of whether they are documented or not).
1 parent cfce033 commit 8970fff

File tree

1 file changed

+5
-2
lines changed

1 file changed

+5
-2
lines changed

README.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,9 @@ The data within should identify clearly (A) the type and function of a model, cl
66

77
## Version
88

9-
This is specification version 1.0.0.
9+
This is specification version 1.0.1.
10+
11+
Versions are (approximately) SemVer: `(backward-breaking).(forward-breaking).(non-breaking-change)`.
1012

1113
## Technical Placement
1214

@@ -85,6 +87,7 @@ This specification defines 3 categories of key: **MUST**, **SHOULD**, **CAN**
8587
| `prediction_type` | **CAN** | In Stable Diffusion, `v` or `epsilon`. Other model classes may have their own concepts that apply. | `v`, `epsilon` |
8688
| `timestep_range` | **CAN** | If a model is tuned on a sub-section of possible timesteps (Timestep-Expert Models), identify it here, in the format `<min>,<max>`. | `500,999`, `0,499` |
8789
| `encoder_layer` | **CAN** | (Specialty) for "clip skip" in Stable Diffusion models, or similar practice in other models like it, this can be applied where relevant to identify that a non-standard layer of an encoder model should be used (so for example value `2` in an SD model indicates `clip_skip=2` should be used). | `2` |
90+
| `preprocessor` | **CAN** | (Specialty) for "ControlNet" or similar model-adapter types that require preprocessing, this is an indicator of the preprocessing type, as a simple text identifier. Should not identify exact tool (eg "MiDaS"), just the broad type (eg "depth"). | `depth`, `canny` |
8891

8992
#### Text-Prediction Models
9093

@@ -101,7 +104,7 @@ The following is a list of common Architecture ID values, both to serve as a ref
101104

102105
- **Stable Diffusion:** `stable-diffusion-v1`, or change `v1` to any of: `v1`, `v1-inpainting` `v2-512`, `v2-768-v`, `v2-depth`, `v2-inpainting`, `v2-unclip-h`, `v2-unclip-l`, `xl-v1-base`, `xl-v1-refiner`
103106
- **Stable Diffusion Components:** `stable-diffusion-xl-v1/vae` (change `stable-diffusion-xl-v1` to the base model architecture)
104-
- **Stable Diffusion Adapters:** `stable-diffusion-v1/lora`, `stable-diffusion-v1/textual-inversion` (change `stable-diffusion-v1` to the base model architecture)
107+
- **Stable Diffusion Adapters:** `stable-diffusion-v1/lora`, `stable-diffusion-v1/textual-inversion`, `stable-diffusion-v1/controlnet`, `stable-diffusion-v1/control-lora` (change `stable-diffusion-v1` to the base model architecture)
105108
- **Language Models:** `gpt-neo-x`
106109

107110
(Note: relevant project leads for well-known model formats are welcomed to PR additions to this list)

0 commit comments

Comments
 (0)