Image Generation

SwarmUI provides a powerful web interface for Stable Diffusion image generation with ComfyUI backend, supporting advanced workflows, ControlNet, LoRA, and batch generation.

Overview

Service: genai-swarmui (/srv/compose/genai/swarmui)

SwarmUI is a feature-rich interface for AI image generation using Stable Diffusion models, designed for both beginners and advanced users.

Features

Multiple Model Support - SD 1.5, SDXL, SD3, and custom models
ComfyUI Backend - Advanced workflow system with nodes
ControlNet - Precise control over composition and pose
LoRA Support - Fine-tuned model adaptations
Batch Generation - Queue multiple images
Custom Workflows - Save and share generation pipelines
Persistent Storage - Models and outputs preserved
GPU Acceleration - Fast generation with NVIDIA GPUs

Requirements

GPU Requirements

Minimum:

NVIDIA GPU with 6GB+ VRAM
Models: SD 1.5 (512x512)
Generation time: 10-30 seconds per image

Recommended:

NVIDIA GPU with 12GB+ VRAM (RTX 3080, 4080, 4090)
Models: SDXL (1024x1024)
Generation time: 5-15 seconds per image
Batch generation: Multiple images simultaneously

VRAM Usage by Model:

Model	Resolution	VRAM	Speed (RTX 4090)
SD 1.5	512x512	4-6 GB	3-5 sec
SD 1.5	768x768	6-8 GB	8-12 sec
SDXL	1024x1024	8-10 GB	10-15 sec
SD3	1024x1024	10-12 GB	12-18 sec

Storage Requirements

Models: 2-6 GB each (SD 1.5: 2GB, SDXL: 6GB)
LoRAs: 10-200 MB each
VAEs: 300-800 MB each
Outputs: Variable (save what you want)

Recommended: 100GB+ free space for model collection

Configuration

Environment Variables

File: /srv/compose/genai/swarmui/.env.local

# User/group IDs
UID=1000
GID=1000

# GPU device
GPU_ID=0

# Data directories
DATA_DIR=/srv/appdata
SWARMUI_DATA=/srv/appdata/swarmui

# Domain
TRAEFIK_ACME_DOMAIN=yourdomain.com

# Port (default: 7801)
SWARMUI_PORT=7801

Directory Structure

${DATA_DIR}/
├── models/                 # Stable Diffusion models
│   ├── Stable-diffusion/  # Main models (SD 1.5, SDXL)
│   ├── Lora/              # LoRA files
│   ├── VAE/               # VAE models
│   ├── ControlNet/        # ControlNet models
│   └── embeddings/        # Textual inversions
├── output/                # Generated images
│   └── [YYYY-MM-DD]/     # Organized by date
└── swarmui/
    ├── Home/              # User settings
    ├── Data/              # App data
    ├── Back/              # Backend configs
    ├── Exts/              # Extensions
    ├── Node/              # ComfyUI custom nodes
    └── Work/              # Custom workflows

Network Configuration

Port: 7801 (internal and host)
Networks: genai (internal), proxy (Traefik)
Access: https://swarmui.${TRAEFIK_ACME_DOMAIN}

Deployment

Start Service

# Ensure GPU is available
docker smi

# Start SwarmUI
sudo composectl start genai-swarmui

# Check status (may take 1-2 minutes to initialize)
composectl status genai-swarmui

# View logs
composectl logs -f genai-swarmui

# Access: https://swarmui.yourdomain.com

First-Time Setup

Navigate to URL: https://swarmui.yourdomain.com
Wait for initialization: Backend downloads on first start (~2-3 minutes)
Configure backend: Select ComfyUI backend
Download models: See Model Management section below

Model Management

Download Models

Built-in Model Browser:

Click "Models" tab in SwarmUI
Browse available models
Click "Download" on desired model
Wait for download (2-6 GB)

Manual Download:

# Download to host
cd ${DATA_DIR}/models/Stable-diffusion

# Example: Realistic Vision (SD 1.5)
wget https://civitai.com/api/download/models/[model-id] -O realistic-vision-v5.safetensors

# Example: SDXL base model
wget https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/resolve/main/sd_xl_base_1.0.safetensors

Popular Starting Models:

SD 1.5 (2GB each):

Realistic Vision v5.0
DreamShaper 8
Deliberate v2

SDXL (6GB each):

SDXL Base 1.0
SDXL Refiner 1.0
Juggernaut XL

Model Sources

CivitAI: https://civitai.com/ (community models)
HuggingFace: https://huggingface.co/models (official models)
Model Database: Built into SwarmUI

LoRA and Extensions

LoRAs (style adaptations):

cd ${DATA_DIR}/models/Lora
# Download LoRA files (.safetensors)
# Apply in SwarmUI with weight 0.5-1.0

ControlNet (composition control):

cd ${DATA_DIR}/models/ControlNet
# Download ControlNet models
# Use for pose, depth, edge guidance