Skip to content

Commit 6682448

Browse files
Update README.md
1 parent 30a6283 commit 6682448

File tree

1 file changed

+18
-1
lines changed

1 file changed

+18
-1
lines changed

README.md

Lines changed: 18 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,15 @@
22
<h2>About Project:</h2>
33
<p>
44
<img src='https://github.com/SurajChinna/Image-Style-Transfer-Pytorch/blob/master/assets/download.png' />
5-
The project is about classifying the handwritten digits taken from MNIST data.
5+
The main goal of project is to combine two images and produce new image. The combination works in slightly different way i.e., we combine the style of one image with the content of other image. First we take the image from which we want to extract content usually called <b>content image</b> and take another image from which the style is to be extracted usually called <b>style image</b>. This is the implementation of <a href='https://arxiv.org/pdf/1508.06576.pdf'>this</a> research paper.
6+
</p>
7+
8+
<p>
9+
Convolutional Neural Networks are a type of neural networks which are used widely in Image classification and recongnition. A CNN architecture called VGG19 has been used in this project. The starting layers in this architecture extract the basic features and shapes and later layers will extract more complex image patterns. So for the output image we will take the <b>content</b> from later layers of CNN. For extracting the style of image, we take the correlations between different layers using <a href="https://en.wikipedia.org/wiki/Gramian_matrix">Gram Matrix</a>
10+
</p>
11+
12+
<p>
13+
Initially, we take any random image as target(or taking the content image would be useful) and compute the <b>Content loss</b> and <b>Style loss</b> and decreasing these losses we would reach the perfect target image that has the style of one image and content of other image. For more learning checkout the links below.
614
</p>
715

816
<h2>Languages or frameworks used</h2>
@@ -11,7 +19,16 @@
1119
<li>Python: language</li>
1220
<li>NumPy: library for numerical calculations</li>
1321
<li>Matplotlib: library for data visualisation</li>
22+
<li>PIL: Python Image Library for opening and manage different image formats</li>
1423
<li>torch: a deep learning framework by Facebook AI Research Team</li>
1524
<li>torchvision: package consists of popular datasets, model architectures, and common image transformations for computer vision</li>
1625
</ul>
1726
</p>
27+
28+
<h2>More Learning</h2>
29+
<p>
30+
<ul>
31+
<li>https://www.youtube.com/watch?v=R39tWYYKNcI&index=37&list=PLkDaE6sCZn6Gl29AoE31iwdVwSG-KnDzF</li>
32+
<li>https://towardsdatascience.com/artistic-style-transfer-b7566a216431</li>
33+
</ul>
34+
</p>

0 commit comments

Comments
 (0)