GitHub - aurelius-in/Epiphany: Epiphany AI Art Studio is a self-hosted full-stack diffusion suite for image and video generation with a Next.js UI, Express APIs, and FastAPI GPU workers—supporting SDXL (txt2img/img2img/inpainting/ControlNet), text-to-video, prompt enhancement, Safe→Research→NSFW modes, editing tools (upscale/bg-remove/face-restore), Redis + S3 + explainability

Epiphany AI Art Studio — Full-Stack Diffusion + Video

A production‑ready AI art studio for images and video.

Frontend: Next.js 14 + Tailwind (or static UI served via public/index.html)
API Gateway: Node 20 + Express + Zod + Prisma + BullMQ + S3/MinIO
Inference Workers (GPU): Python FastAPI + PyTorch + Diffusers (SDXL, ControlNet)
Video: Python FastAPI + Stable Video Diffusion / ModelScope T2V (pluggable)
Editing: Real‑ESRGAN, GFPGAN, rembg, PIL/OpenCV ops
Explainability: attention maps, token attributions, safety scores
Infra: Postgres, Redis, MinIO, Docker Compose (GPU via nvidia‑container‑toolkit)

Note: Root static files were moved to apps/web/public/. Use that folder for UI assets.

Quick start (compose)

Build and start: make up
Create buckets (first time): make buckets (requires mc)
Tail logs: make logs

What you can do

Text→Image (SDXL base/refiner), Img2Img, Inpainting
ControlNet (canny, depth, pose)
Text→Video, Image→Video (pan/zoom), Stylize a video
Post‑gen edits: Upscale, Restore face, Remove background, Crop/Resize, Caption
Prompt enhancement, presets, seeds, steps/CFG, aspect ratios
Safe ↔ Research ↔ NSFW modes honored end‑to‑end
Job queue with progress, previews, resumable history/gallery
Explainability overlays (attention heatmaps, token scores)
Full audit logs (prompts/params/model hash/safety/timings)

Monorepo layout

epiphany/
  apps/
    web/                 # Next.js UI or static index.html/css in public/
  services/
    api/                 # Express API (Zod, Prisma, BullMQ, S3 SDK)
    infer-image/         # FastAPI + Diffusers (SDXL, img2img, inpaint, ControlNet)
    infer-video/         # FastAPI + T2V adapter (SVD/ModelScope)
    edit/                # FastAPI editing tools (Real-ESRGAN, GFPGAN, rembg, PIL)
    explain/             # FastAPI explainability (attention maps, token scores)
  packages/
    sdk/                 # TypeScript SDK + Zod schemas for /v1 endpoints
  infra/
    compose/             # docker-compose.yml + .env.example + Makefile
  ops/
    prisma/              # schema.prisma + migrations
    scripts/             # smoke tests, seeding
  assets/
    logos/               # epiphany.gif, gen-ai.gif
    test/                # sample inputs

APIs (contract overview)

Headers: X-API-Key: <string>, Content-Type: application/json

POST /v1/enhance → { prompt } → { promptEnhanced, seedPhrases[] }
POST /v1/generate/image → txt2img/img2img/inpaint/controlnet
POST /v1/generate/video → text→video, animate, stylize
POST /v1/edit/* → upscale, restore-face, remove-bg, crop, resize, caption
GET /v1/jobs/:id → { status, progress, outputUrl, previewUrls[], explainId, caption }
GET /v1/jobs/:id/stream → Server-Sent Events for live progress
Note: SSE also accepts ?key=<API_KEY> for environments where headers are stripped
GET /v1/jobs/by-generation/:id → lookup job for a generation
POST /v1/jobs/by-generation/:id/cancel → cancel job for a generation
DELETE /v1/jobs/:id → remove a job (if present)
HEAD /v1/jobs/:id → 200 if job exists
GET /v1/generations → recent history (paginated; supports ?signed=1&ttl=900)
GET /v1/generations/search → search by prompt substring (?q=)
GET /v1/generations/filter → filter by modelId/stylePreset
GET /v1/tags → lists available models[] and styles[]
GET /v1/generations/:id/events → events for a generation
GET /v1/explain/:id → token scores + heatmap URLs
GET /v1/events → recent events (filter by generationId)
GET /v1/events.csv → CSV export (supports ?generationId and ?type)
GET /v1/event-types → list distinct event types
GET /v1/errors → failed generations
POST /v1/retry/:id → retry a failed generation
GET /v1/assets → recent assets (paginated; supports ?signed=1&ttl=900)
GET /v1/assets/:id/signed → returns signed URL for the asset (optional ?ttl=)
GET /v1/assets/sign?url= → sign arbitrary public S3 URL from this MinIO
GET /v1/assets/search → filter by kind/mime
GET /v1/health → { ok, services:{db,redis,s3,infer_image,infer_video,edit,explain} }
GET /v1/version → { name, version }
GET /v1/config → safe runtime config (web origin, rate limit, S3 info)
GET /v1/ping → { pong: true }
GET /v1/system → { health, version, config }
GET /v1/time → { now: <iso> }
GET /v1/uptime → { startedAt, uptimeMs }
GET /v1/queues → BullMQ queue counts by state
GET /v1/_routes → simple introspection of registered routes
GET /v1/metrics → aggregate counts (generations/assets/events/explains)
GET /v1/metrics.csv → single-row CSV of totals
POST /v1/queues/empty → drain a queue by name (body: { name })
GET /v1/stats/daily → daily counts of generations/assets/events (last N days)
GET /v1/stats/daily.csv → CSV of daily stats

All request/response bodies are typed and validated with Zod (SDK included). SSE is supported for job progress.

Data model (Prisma)

Generation: kind(image|video), status, prompts, mode(0|1|2), steps/cfg/seed, modelId/hash, controlnet params, input images, outputs, previews, safety, timings, errors
Asset: url, kind, mime, dims, bytes, sha256
Explain: generationId, tokenScores JSON, heatmapUrls[]
Event: generationId, type, payload JSON, createdAt

Infra and storage

Postgres (Prisma)
Redis (BullMQ queues: generate_image, generate_video, edit_image, explain)
MinIO (S3 buckets): epiphany-outputs, epiphany-inputs, epiphany-explain
Docker Compose with GPU services (nvidia runtime)

Getting started (local)

Install prerequisites

Node 20, pnpm
Python 3.11, CUDA‑compatible GPU + NVIDIA driver
Docker + nvidia‑container‑toolkit

Bootstrap

pnpm install

Env and compose

cp infra/compose/.env.example .env
# edit values for db/redis/s3/api key
make -C infra/compose up   # or: docker compose -f infra/compose/docker-compose.yml up -d --build

Prisma

cd services/api
pnpm prisma:generate && pnpm prisma:migrate dev

Run services (dev)

# API
cd services/api && pnpm dev

# Python workers (examples)
uvicorn main:app --host 0.0.0.0 --port 8001   # infer-image
uvicorn main:app --host 0.0.0.0 --port 8002   # infer-video
uvicorn main:app --host 0.0.0.0 --port 8003   # edit
uvicorn main:app --host 0.0.0.0 --port 8004   # explain

Web

Next.js app in apps/web serves the existing static UI via public/index.html and exposes /gallery and /health pages.

Health

curl -H "X-API-Key: dev" http://localhost:4000/v1/health

Modes and safety

Mode 0: Safe — run safety checker, block/blur disallowed content
Mode 1: Research — run checker, allow output, log scores
Mode 2: NSFW — skip checker (logged as bypassed)

This flag propagates from API → workers → outputs and logs.

Explainability

Capture cross‑attention maps during diffusion
Compute simple token attributions
Save heatmaps (PNG) and scores (JSON) to S3
Expose via /v1/explain/:id and overlay in UI

SDK

packages/sdk: Zod schemas and a small TS client for all /v1 endpoints

Deployment

Ensure NVIDIA drivers and nvidia-container-toolkit installed on host
Set WEB_ORIGIN=http://localhost:3000 (or your domain) for CORS
Optional rate limit: RATE_LIMIT_MAX, RATE_LIMIT_WINDOW_MS
make up to build and start services; make buckets to create MinIO buckets
Run ops/scripts/smoke.sh or ops/scripts/smoke.ps1 to verify end‑to‑end

Environment variables

API: API_PORT, API_KEY, WEB_ORIGIN, ALLOW_NSWF, RATE_LIMIT_MAX, RATE_LIMIT_WINDOW_MS
DB: DATABASE_URL
Redis: REDIS_URL
S3/MinIO: S3_ENDPOINT, S3_REGION, S3_BUCKET, S3_INPUTS_BUCKET, S3_ACCESS_KEY, S3_SECRET_KEY
Workers: INFER_IMAGE_PORT, INFER_VIDEO_PORT, EDIT_PORT, EXPLAIN_PORT
Security: ALLOWED_URL_PREFIXES (comma‑separated URL prefixes allowed for remote fetch of init/mask/media)

GPU deployment notes

Install NVIDIA driver and nvidia-container-toolkit
Ensure CUDA‑compatible PyTorch wheels are used in worker images
Use Compose service GPU flags (already set) or --gpus all when running containers

Color system (UI)

Background #000
Panels #151517 / #101012
Border #26262a
Text #e6e6ea, Muted #a4a4ad
Brand gradient: #FF007A → #7A00FF → #FF6A00
Mode slider: Safe #6bd17d, Research #f0b153, NSFW #FF007A

License

MIT — research/education/portfolio. See audit and safety sections when deploying.

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
.github/workflows		.github/workflows
epiphany		epiphany
Makefile		Makefile
README.md		README.md
epiphany-preview.gif		epiphany-preview.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quick start (compose)

What you can do

Monorepo layout

APIs (contract overview)

Data model (Prisma)

Infra and storage

Getting started (local)

Modes and safety

Explainability

SDK

Deployment

Environment variables

GPU deployment notes

Color system (UI)

License

About

Uh oh!

Releases

Packages

Languages

aurelius-in/Epiphany

Folders and files

Latest commit

History

Repository files navigation

Quick start (compose)

What you can do

Monorepo layout

APIs (contract overview)

Data model (Prisma)

Infra and storage

Getting started (local)

Modes and safety

Explainability

SDK

Deployment

Environment variables

GPU deployment notes

Color system (UI)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages