Shreyz-max
diff --git a/‎README.md
Lines changed: 93 additions & 1 deletion b/‎README.md
Lines changed: 93 additions & 1 deletion
diff --git a/‎images/gaugan.gif
1.76 MB b/‎images/gaugan.gif
1.76 MB
diff --git a/‎images/try1.jpeg
165 KB b/‎images/try1.jpeg
165 KB
diff --git a/‎images/try1.png
10.1 KB b/‎images/try1.png
10.1 KB
diff --git a/‎images/try2.jpeg
209 KB b/‎images/try2.jpeg
209 KB
diff --git a/‎images/try2.png
19.2 KB b/‎images/try2.png
19.2 KB
diff --git a/‎requirements.txt
Lines changed: 1 addition & 1 deletion b/‎requirements.txt
Lines changed: 1 addition & 1 deletion
diff --git a/‎streamlit/app.py
Lines changed: 2 additions & 0 deletions b/‎streamlit/app.py
Lines changed: 2 additions & 0 deletions
@@ -1,2 +1,94 @@
 # Doodle-to-Image-Generator
-This is an automatic realistic image generation from doodles using GauGan which has been deployed on streamlit.
+This is an automatic realistic image generator from doodles using GauGan which has been deployed on streamlit. You can check it out [here](https://shreyz-max-doodle-to-image-generator-streamlitapp-miq4ua.streamlitapp.com/).
+This uses GauGAN to generate the images given the semantic maps or doodles here.
+The model is based on Conditional GAN where given a particular image and a condition the realistic image gets generated.
+
+The model has been taken from nvidia labs SPADE released in 2019.
+## Table of contents
+* <a href="#Idea">Idea</a>
+* <a href="#SampleResults">Sample Results</a>
+* <a href="#Dataset">Dataset</a>
+* <a href="#Setup">Setup</a>
+* <a href="#Components">Different Components</a>
+* <a href="#FineTuning">Fine Tuning the model</a>
+* <a href="#ModelLoss">Model and Loss</a>
+* <a href="#OtherExamples">Other Examples</a>
+* <a href="#References">References</a>
+
+<h2 id="Idea">Idea</h2>
+A lot of interest was captured when GauGAN2 was released by Nvidia recently.
+I wanted to check it out but turns out GauGAN2 has not yet been open-sourced to the public.
+So, I started looking into GauGAN in general and found this implementation.
+I wanted to fully understand the functioning of the model and understand how even though it was adopted from pix2pix model,
+it still had way better results. Another factor was I wanted to make the front end easily accessible by the data science community
+since not everyone is well versed with html and css. Hence, I deployed it on streamlit.
+
+<h2 id="SampleResults">Sample Results</h2>
+Here is the working of GauGAN in real life deployed on a website.
+
+<p align = "center"><img align = "center" src = "images/gaugan.gif" /></p>
+
+<h2 id="Dataset">Dataset</h2>
+Originally in the SPADE paper, the model was trained on 3 different datasets namely COCO, cityscapes and  ADE20K. Although, Flcikr dataset was also used however I am not so sure about the segmentation
+of that dataset. The model has been trained on 8 V100 GPUs that equals 128 GB of memory. So, to avoid any of such memory problems
+I used a pretrained dataset. However, I tried training on custom dataset as well. You can find the details to that here.
+
+<h2 id="Setup">Setup</h2>
+
+You can easily setup this application. Here are the steps to replicate my outcome in your system.
+
+Clone the repository. <code>git clone https://github.com/Shreyz-max/Doodle-to-Image-Generator.git</code>
+
+Create a conda environment. <code>conda create -n doodle_image python=3.10</code>
+
+Activate environment. <code>conda activate doodle_image</code>
+
+Install requirements file. <code>pip install -r requirements.txt</code>
+
+Run app.py <code>streamlit run streamlit/app.py</code>
+
+<h2 id="Components">Different Components</h2>
+<code>app.py</code> has all of the streamlit code to run the frontend.
+
+<code>label_colors.py</code> contains a list of dictionaries for each label as well as it's corresponding color that I have assigned
+and its corresponding id in the coco dataset.
+
+<h2 id="FineTuning">Fine Tuning the model</h2>
+Here are a few things that I did.
+So basically, GauGAN is trained to take a black and white semantic map and convert it into a realisitc image.
+So, once we have a painted image, it is converted into black and white using its labels. I have selected a few labels from COCO
+dataset. You have 182 labels. So, you can choose any of the labels. Just select a few labels from your choice from COCO dataset.
+Change the color based on what you like in `label_colors.py`. Make sure that the ids of those labels match those of the COCO dataset.
+Also make the changes in the select-box of `app.py`.
+    In case you want to use a different model with different datasets. Download the model from here. Use `latest_net_G.pth` for this.
+
+<h2 id="ModelLoss">Model and Loss</h2>
+To understand the model and the different types of losses, I would suggest reading the paper here.
+To train on your dataset, you can follow my repository here. This follows you through how to train in google colab. You can then download the model and load it in this project.
+Make a few changes as mentioned above, and you will have a working frontend as well.
+
+<h2 id="OtherExamples">Other Examples</h2>
+Some other results to enjoy:
+<h3 id="Performance">Performance of both algorithms on testing data</h3>
+<table>
+ <tr>
+  <th>Doodle Input</th>
+  <th>Realistic Image</th>
+ </tr>
+<tr>
+ <td><img src="images/try1.png" width="420px"/></td>
+ <td><img src="images/try1.jpeg" width="420px"/></td>
+ </tr>
+<tr>
+ <td><img src="images/try2.png" width="420px"/></td>
+ <td><img src="images/try2.jpeg" width="420px"/></td>
+ </tr>
+</table>
+
+<h2 id="References">References</h2>
+
+[Spade Paper](https://arxiv.org/pdf/1903.07291.pdf)
+
+[Spade Implementation](https://github.com/NVlabs/SPADE/tree/master)
+
+[Flask implementation](https://github.com/mcheng89/gaugan)
@@ -1,5 +1,5 @@
 Pillow==9.2.0
-streamlit==1.11.1
+streamlit>=1.10.0
 streamlit-drawable-canvas==0.9.1
 torch==1.12.0
 torchvision==0.13.0
 
@@ -8,6 +8,7 @@
 
 
 st.set_page_config(layout="wide")
+
 # Specify canvas parameters in application
 drawing_object = st.sidebar.selectbox(
     "Object:", ("sea", "cloud", "bush", "grass", "mountain", "sky", "snow",
@@ -24,6 +25,7 @@
 
 stroke_color = drawing_object_dict[drawing_object]
 
+
 col1, col2 = st.columns(2)
 with col1:
     # Create a canvas component with different parameters