solved issue of unknown watermark �

ATolkachev · Dec 1, 2020 · 165eeda · 165eeda
1 parent b68bdca
commit 165eeda
Show file tree

Hide file tree

Showing 25 changed files with 235 additions and 275 deletions.
diff --git a/.gitignore b/.gitignore
@@ -1,3 +1,3 @@
 __pycache__
-data
+data/watermark-unavailable/*
 model
diff --git a/README.md b/README.md
@@ -1,9 +1,145 @@
-# Watermark Removal using Deep Image Priors using Pytorch
+# Watermark Removal using Deep Image Priors with Pytorch
 
-Original Image                    |  Watermark                       | Watermarked Image
-:---------------------------------|:---------------------------------|:---------------------------------:
+## __This is the implementation of paper [Deep Image Prior](https://dmitryulyanov.github.io/deep_image_prior), all credit goes its authors.__
 
-![Watermarked Image Visualization](data/sample.png)
+CNNs are very common for image generation and restoration tasks. And it is believed that their great performance is because of their ability to learn realistic image priors from training on large datasets. This paper shows that the structure of a generator alone is sufficient to provide enough low-level image statistics without any learning. Thus most of the image restoration tasks, for example denoising, super resolution, artifacts removal, watermark removal etc can be done with highly realistic results without any training.
 
-## Generator's progress
-![Progress of the generator](progress.gif)
+In this repo I've implemented watermark removal task, and the results are just as good as claimed by the authors. 
+
+## When the watermark is available.
+
+### # So in this scenario, the requirements are:
+
+- The watermark that is applied to the watermarked image, is available to you.
+
+- The scale, position, rotation and other spatial transformations of the watermark, extactly matches to the applied watermark of the image.
+
+For example-
+
+<table style="float:center">
+ <tr>
+  <th>Original Image</th><th>Watermark</th><th>Watermarked-Image</th>
+ </tr>
+ <tr>
+  <td>
+   <img src='./outputs/watermark-available/original-image.png' >
+  </td>
+  <td>
+    <img src='./outputs/watermark-available/watermark.png' >
+  </td>
+  <td>
+    <img src='./outputs/watermark-available/watermarked-image.png' >
+  </td>
+</tr>
+</table>
+
+### # Running inference
+
+Run `$ python inference.py` with following arguments-
+
+```
+
+Removing Watermark
+
+optional arguments:
+-h, --help          show this help message and exit
+--image-path IMAGE_PATH
+                    Path to the "watermarked" image.
+--watermark-path WATERMARK_PATH
+                    Path to the "watermark" image.
+--input-depth INPUT_DEPTH
+                    Max channel dimenstion of the noise input. Set it
+                    based on gpu/device memory you have available.
+--lr LR             Learning rate.
+--training-steps TRAINING_STEPS
+                    Number of training iterations.
+--show-steps SHOW_STEPS
+                    Interval for visualizing results.
+--reg-noise REG_NOISE
+                    Hyper-parameter for regularized noise input.
+--device DEVICE
+                    Device for pytorch, either "cpu" or "cuda".
+
+```
+
+### # Outputs
+
+![Progress of the generator](outputs/watermark-available/progress.gif)
+
+## __When the watermark is not available__.
+
+In this scenario we'll have the watermaked image only. And this is the actual and highly realistic scenario because of obvious reasons. For very trivial causes, the first scenario was too easy to tackle than this one. Even the authors provided the outputs for first scenario only.
+
+### # Let us see how seriously difficult this is:
+
+- Only watermarked image is available.
+
+- We can provide absolutely __no__ info to the generator regarding:
+
+    - Where is the watermark?
+    - What exactly is the part of watermark?
+    - What exactly is not the part of watermark?
+
+- And we want to do this without any training!!! Why? Well there's no point, I mean we know for a fact that the generator is indeed capable of inpainting the watermark, __its just us who are not able to provide the answers to the generator for questions above.__
+
+### # My solution
+
+Read the last bold statement again, if we solve that issue, then its just a matter of following first scenario ain't it. I hope you can see where I'm going from this 😉. If you look at bigger picture of watermark removal, then in a nutshell its just an image inpainting task right? So, all we need to do is, roughly highlight the watermarked region from any paint software and you're good to go.
+
+Yup its just that simple.
+
+Yes, its not a holy new solution, I've seen it done before in form of image inpainting. But I never saw anyone applying this for removing watermark.
+
+Now you might be thinking that its not a fully automated task anymore, since you would have to manually sit..tahh and highlight the watermarked region. Though I'm pretttty sure that after seeing the final results, you surely would enjoy that sitting :)
+
+Moreover, think about how many problems we are solving by just simply doing this:
+
+- No need to train a watermark detection model. Its actually hard to do than typical object detections.
+
+- Even if we are able to detect watermark, its still doesn't help that much, cuz the watermark can be drawn on whole image, not on just a small region.
+
+- No need to train the generator on huge image datasets for learing image statistics.
+
+- No need to train the generator over adversarial loss, which is already very difficult for producing higher resolution images like 1024 and more...
+
+- And, all other solutions I've seen to far, which try to automate the whole procedure of detecting and removing watermark, produces very visible artificats.
+
+Okay, enought talk.
+
+First step is to create an overlay containing strokes, that hides the watermark.
+
+I'm simply using MS Paint for that. And per image it hardly takes 1 minute to draw.
+
+Here are some sample overlayes...
+
+![Overlay example](outputs/watermark-unavailable/overlays/overlay1.png)
+
+![Overlay example](outputs/watermark-unavailable/overlays/overlay2.png)
+
+## # Time for the results
+
+### Experiment 1
+
+![Experiment result](outputs/watermark-unavailable/output1.png)
+
+### Experiment 2
+
+![Experiment result](outputs/watermark-unavailable/output2.png)
+
+### Experiment 3
+
+![Experiment result](outputs/watermark-unavailable/output3.png)
+
+### Experiment 4
+
+![Experiment result](outputs/watermark-unavailable/output4.png)
+
+### Experiment 5
+
+![Experiment result](outputs/watermark-unavailable/output5.png)
+
+### Experiment 6
+
+![Experiment result](outputs/watermark-unavailable/output6.png)
+
+As I said earlier, the outputs are highly realistic with almost unnoticeable artifacts.
diff --git a/data/me.jpg b/data/me.jpg
diff --git a/data/sample.png b/data/sample.png
diff --git a/data/image1.png → data/watermark-available/image1.png b/data/image1.png → data/watermark-available/image1.png
diff --git a/data/image2.png → data/watermark-available/image2.png b/data/image2.png → data/watermark-available/image2.png
diff --git a/data/image3.png → data/watermark-available/image3.png b/data/image3.png → data/watermark-available/image3.png
diff --git a/data/watermark.png → data/watermark-available/watermark.png b/data/watermark.png → data/watermark-available/watermark.png
diff --git a/data/watermark2.png b/data/watermark2.png
diff --git a/inference.py b/inference.py
@@ -5,68 +5,79 @@
 import os
 import numpy as np
 from PIL import Image
+from tqdm.auto import tqdm
 
 from helper import *
 from model.generator import SkipEncoderDecoder, input_noise
 
-DTYPE = torch.cuda.FloatTensor
-INPUT_DEPTH = 32
-LR = 0.01 
-TRAINING_STEPS = 6001
-SHOW_STEP = 50
-REG_NOISE = 0.03
+import argparse
 
-image_path  = os.path.join('data', 'me.jpg')
-watermark_path = os.path.join('data', 'watermark2.png')
+parser = argparse.ArgumentParser(description = 'Removing Watermark')
+parser.add_argument('--image-path', type = str, default = './data/watermark-available/me.jpg', help = 'Path to the "watermarked" image.')
+parser.add_argument('--watermark-path', type = str, default = './data/watermark-available/watermark.png', help = 'Path to the "watermark" image.')
+parser.add_argument('--input-depth', type = int, default = 32, help = 'Max channel dimension of the noise input. Set it based on gpu/device memory you have available.')
+parser.add_argument('--lr', type = float, default = 0.01, help = 'Learning rate.')
+parser.add_argument('--training-steps', type = int, default = 4000, help = 'Number of training iterations.')
+parser.add_argument('--show-steps', type = int, default = 50, help = 'Interval for visualizing results.')
+parser.add_argument('--reg-noise', type = float, default = 0.03, help = 'Hyper-parameter for regularized noise input.')
+parser.add_argument('--device', type = str, default = 'cuda', help = 'Device for pytorch, either "cpu" or "cuda".')
 
-image_pil = read_image(image_path)
+args = parser.parse_args()
+if args.device == 'cuda' and not torch.cuda.is_available():
+    args.device = 'cpu'
+    print('\nSetting device to "cpu", since torch is not built with "cuda" support...')
+
+DTYPE = torch.cuda.FloatTensor if args.device == "cuda" else torch.FloatTensor
+
+image_pil = read_image(args.image_path)
 image_pil = image_pil.convert('RGB')
+image_pil = image_pil.resize((128, 128))
 
-image_mask_pil = read_image(watermark_path)
+image_mask_pil = read_image(args.watermark_path)
 image_mask_pil = image_mask_pil.convert('RGB')
 image_mask_pil = image_mask_pil.resize((image_pil.size[0], image_pil.size[1]))
 
 image_np = pil_to_np_array(image_pil)
 image_mask_np = pil_to_np_array(image_mask_pil)
 image_mask_np[image_mask_np == 0.0] = 1.0
 
-image_mask_var = np_to_torch_array(image_mask_np).type(DTYPE)
+image_var = np_to_torch_array(image_np).type(DTYPE)
+mask_var = np_to_torch_array(image_mask_np).type(DTYPE)
 
 visualize_sample(image_np, image_mask_np, image_mask_np * image_np, nrow = 3, size_factor = 12)
 
-
+print('Building model...\n')
 generator = SkipEncoderDecoder(
-    INPUT_DEPTH,
+    args.input_depth,
     num_channels_down = [128] * 5,
     num_channels_up = [128] * 5,
     num_channels_skip = [128] * 5
 ).type(DTYPE)
-generator_input = input_noise(INPUT_DEPTH, image_np.shape[1:]).type(DTYPE)
+generator_input = input_noise(args.input_depth, image_np.shape[1:]).type(DTYPE)
 summary(generator, generator_input.shape[1:])
 
-objective = torch.nn.MSELoss().type(DTYPE)
-optimizer = optim.Adam(generator.parameters(), LR)
+objective = nn.MSELoss()
+optimizer = optim.Adam(generator.parameters(), args.lr)
 
-image_var = np_to_torch_array(image_np).type(DTYPE)
-mask_var = np_to_torch_array(image_mask_np).type(DTYPE)
+generator_input_saved = generator_input.clone()
+noise = generator_input.clone()
+generator_input = generator_input_saved
 
-generator_input_saved = generator_input.detach().clone()
-noise = generator_input.detach().clone()
+print('\nStarting training...\n')
 
-for step in range(TRAINING_STEPS):
+for step in tqdm(range(args.training_steps), desc = 'Completed', ncols = 100):
     optimizer.zero_grad()
-    generator_input = generator_input_saved
 
-    if REG_NOISE > 0:
-        generator_input = generator_input_saved + (noise.normal_() * REG_NOISE)
+    if args.reg_noise > 0:
+        generator_input = generator_input_saved + (noise.normal_() * args.reg_noise)
 
     out = generator(generator_input)
 
     loss = objective(out * mask_var, image_var * mask_var)
     loss.backward()
 
-    if step % SHOW_STEP == 0:
+    if step % args.show_steps == 0:
         out_np = torch_to_np_array(out)
         visualize_sample(np.clip(out_np, 0, 1), nrow = 1, size_factor = 5)
 
-    optimizer.step()
+    optimizer.step()
diff --git a/model/__pycache__/generator.cpython-37.pyc b/model/__pycache__/generator.cpython-37.pyc
diff --git a/model/__pycache__/modules.cpython-37.pyc b/model/__pycache__/modules.cpython-37.pyc
diff --git a/notebook.ipynb b/notebook.ipynb
diff --git a/outputs/watermark-available/original-image.png b/outputs/watermark-available/original-image.png
diff --git a/outputs/watermark-available/progress.gif b/outputs/watermark-available/progress.gif
diff --git a/outputs/watermark-available/watermark.png b/outputs/watermark-available/watermark.png
diff --git a/outputs/watermark-available/watermarked-image.png b/outputs/watermark-available/watermarked-image.png
diff --git a/outputs/watermark-unavailable/output1.png b/outputs/watermark-unavailable/output1.png
diff --git a/outputs/watermark-unavailable/output2.png b/outputs/watermark-unavailable/output2.png
diff --git a/outputs/watermark-unavailable/output3.png b/outputs/watermark-unavailable/output3.png
diff --git a/outputs/watermark-unavailable/output4.png b/outputs/watermark-unavailable/output4.png
diff --git a/outputs/watermark-unavailable/output5.png b/outputs/watermark-unavailable/output5.png
diff --git a/outputs/watermark-unavailable/output6.png b/outputs/watermark-unavailable/output6.png
diff --git a/outputs/watermark-unavailable/overlays/overlay1.png b/outputs/watermark-unavailable/overlays/overlay1.png
diff --git a/outputs/watermark-unavailable/overlays/overlay2.png b/outputs/watermark-unavailable/overlays/overlay2.png