Cross Attention Map

Thanks to HuggingFace Diffusers team for the GPU sponsorship!

This repository is for extracting and visualizing attention maps, compatible with the latest Diffusers code (v0.29.0).

For errors reports or feature requests, please raise an issue :)

Update Log

[2024-07-04] 🎉 (Latest update) Added features for saving attention maps based on timesteps and paths, and refactored the code. 🎉

Compatible models

UNet with attn2(cross attention module) is compatible

Examples

6_kangaroo

10_hoodie

13_sunglasses

Initialize

python -m venv .venv
source .venv/bin/activate
# or
conda create -n attn python=3.9 -y
conda activate attn

pip install -r requirements.txt

Visualize

Visualize Cross Attention Map for Text-to-Image

python t2i.py

How to use

There are two methods for saving the attention map.: save_by_timesteps_and_path or save_by_timesteps(more intuitive.)

import torch
from diffusers import DiffusionPipeline
from utils import (
    attn_maps,
    cross_attn_init,
    register_cross_attention_hook,
    set_layer_with_name_and_path,
    save_by_timesteps_and_path,
    save_by_timesteps
)

##### 1. Init modules #####
cross_attn_init()
###########################

pipe = DiffusionPipeline.from_pretrained(
    "stabilityai/stable-diffusion-xl-base-1.0",
    torch_dtype=torch.float16,
)
pipe = pipe.to("cuda:0")

##### 2. Replace modules and Register hook #####
pipe.unet = set_layer_with_name_and_path(pipe.unet)
pipe.unet = register_cross_attention_hook(pipe.unet)
################################################

height = 512
width = 768
prompt = "A portrait photo of a kangaroo wearing an orange hoodie and blue sunglasses standing on the grass in front of the Sydney Opera House holding a sign on the chest that says 'SDXL'!."

image = pipe(
    prompt,
    height=height,
    width=width,
    num_inference_steps=15,
).images[0]
image.save('test.png')

##### 3. Process and Save attention map #####
print('resizing and saving ...')

##### 3-1. save by timesteps and path (2~3 minutes) #####
save_by_timesteps_and_path(pipe.tokenizer, prompt, height, width)
#########################################################

##### 3-2. save by timesteps (1~2 minutes) #####
# save_by_timesteps(pipe.tokenizer, prompt, height, width)
################################################

TODO

Applications(prompt-to-prompt, ...)

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
modules.py		modules.py
requirements.txt		requirements.txt
t2i.py		t2i.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross Attention Map

Update Log

Compatible models

Examples

Initialize

Visualize

How to use

TODO

About

Releases

Packages

Contributors 4

Languages

License

wooyeolBaek/attention-map

Folders and files

Latest commit

History

Repository files navigation

Cross Attention Map

Update Log

Compatible models

Examples

Initialize

Visualize

How to use

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages