PDME

Prompt-Driven Masked Editing for Images

Project for AI702: Deep Learning at MBZUAI !

Preserving the background of images while making localised edits using text- prompted generative models for visual content remains a challenge for current high-quality, state-of-the-art algorithms. In this project, we explore the potential of combining two promising but separate technologies for automatically defining desired image regions and making highly localised edits within them by generating new pixel-space image content based only on text prompts.

To set up this repository, please first install the required libraries listed in "requirements.txt". The system used in development is Ubuntu 22.04 with CUDA Toolkit 11.7

Then, download the required model weights from the following link: https://mbzuaiac-my.sharepoint.com/:u:/g/personal/kane_lindsay_mbzuai_ac_ae/EcIJJlGPIPpIhPA9KH7EbooB--JFof5wLPMbbWlp_2wLAw?e=NQ8JHe

Finally, place the downloaded files in the following folders to set up the project:

In "/checkpoints/"

256*256_diffusion_uncond.pt

In "/LAVT-RIS/checkpoints/":

gref_google.pth
gref_umd.pth
refcoco.pth
refcoco+.pth
swin_base_patch4_window12_384_22k.pth

In "/LAVT_RIS/pretrained_weights/"

lavt_one_8_cards_ImgNet22KPre_swin-base-window12_refcocogGOOG_adamw_b32lr0.00005wd1e-2_E40.pth

In "/LAVT_RIS/refer/data" (optional)

images
refclef
refcoco
refcoco+
refcocog

PDME is best run using command line arguments. The file 'run.sh' is provided for convenience with some preset arguments as an example. You can provide paths to your own images, target prompts, and edit prompts to produce edits from your own images.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CLIP		CLIP
LAVT_RIS		LAVT_RIS
bert		bert
guided_diffusion		guided_diffusion
input_images		input_images
optimization		optimization
utils		utils
README.md		README.md
main.py		main.py
mask_prediction.py		mask_prediction.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDME

Prompt-Driven Masked Editing for Images

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

HosamGen/PDME

Folders and files

Latest commit

History

Repository files navigation

PDME

Prompt-Driven Masked Editing for Images

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages