brucechou1983
diff --git a/‎.gitignore
Lines changed: 3 additions & 1 deletion b/‎.gitignore
Lines changed: 3 additions & 1 deletion
diff --git a/‎README.md
Lines changed: 16 additions & 33 deletions b/‎README.md
Lines changed: 16 additions & 33 deletions
diff --git a/‎augmenter.py
Lines changed: 8 additions & 0 deletions b/‎augmenter.py
Lines changed: 8 additions & 0 deletions
diff --git a/‎callback.py
Lines changed: 6 additions & 24 deletions b/‎callback.py
Lines changed: 6 additions & 24 deletions
diff --git a/‎cam.py
Lines changed: 147 additions & 0 deletions b/‎cam.py
Lines changed: 147 additions & 0 deletions
diff --git a/‎cam_example.png
770 KB b/‎cam_example.png
770 KB
@@ -5,5 +5,7 @@ densenet121_weights_tf.h5
 /.idea
 /experiments
 /data/Data_Entry_2017.csv
+/data/BBox_List_2017.csv
 venv
-config.ini
+config.ini
+*.pdf
@@ -1,50 +1,33 @@
 # ChexNet-Keras
 This project is a tool to build CheXNet-like models, written in Keras.
 
-<img width="450" height="450" src="https://stanfordmlgroup.github.io/projects/chexnet/img/chest-cam.png" alt="CheXNet from Stanford ML Group"/>
+<img width="450" height="450" src="cam_example.png" alt="CAM example image"/>
 
-## System Requirements
-1. Tensorflow_GPU == 1.4 (CUDA 8)
-2. Keras == 2.1.4
-3. numpy
-4. opencv-python (i.e. cv2) ==3.3
-5. At least one Nvidia 1080Ti GPU to enable batch_size = 32
-
-### Important notice on CUDA users
-If you use >= CUDA 9.1, please modify requirements.txt, such that tensorflow_gpu == 1.5
- 
 ## What is [CheXNet](https://arxiv.org/pdf/1711.05225.pdf)?
-ChexNet is a deep learning algorithm that can detect and localize 14 kinds of diseases from chest X-ray images. As described in the paper, a 121-layer densely connected convolutional neural network is trained on ChestX-ray14 dataset, which contains 112,120 frontal view X-ray images from 30,805 unique patients. The result is so good that it surpasses the performance of practicing radiologists.
+ChexNet is a deep learning algorithm that can detect and localize 14 kinds of diseases from chest X-ray images. As described in the paper, a 121-layer densely connected convolutional neural network is trained on ChestX-ray14 dataset, which contains 112,120 frontal view X-ray images from 30,805 unique patients. The result is so good that it surpasses the performance of practicing radiologists. If you are new to this project, [Luke Oakden-Rayner's post](https://lukeoakdenrayner.wordpress.com/2017/12/18/the-chestxray14-dataset-problems/) is highly recommended.
 
 ## In this project, you can
 1. Train/test a **baseline model** by following the quickstart. You can get a model with performance close to the paper.
-2. Modify `multiply` and `use_class_balancing` parameters in `config.ini` to see if you can get better performance.
-3. Modify `weights.py` to customize your weights in loss function.
-4. Every time you do a new experiment, make sure you modify `output_dir` in `config.ini` otherwise previous training results might be overwritten. For more options check the parameter description in `config.ini`.
+2. Run class activation mapping to see the localization of your model.
+3. Modify `multiply` parameter in `config.ini` or design your own class weighting to see if you can get better performance.
+4. Modify `weights.py` to customize your weights in loss function. If you find something useful, feel free to make that an option and fire a PR.
+5. Every time you do a new experiment, make sure you modify `output_dir` in `config.ini` otherwise previous training results might be overwritten. For more options check the parameter description in `config.ini`.
 
 ## Quickstart
 **Note that currently this project can only be executed in Linux and macOS. You might run into some issues in Windows.**
-1. Download **all tar files** and **Data_Entry_2017.csv** of ChestX-ray14 dataset from [NIH dropbox](https://nihcc.app.box.com/v/ChestXray-NIHCC). Put them under `./data` folder and untar all tar files.
-2. Download DenseNet-121 ImageNet tensorflow pretrained weights from [DenseNet-Keras](https://drive.google.com/open?id=0Byy2AcGyEVxfSTA4SHJVOHNuTXc). Specify the file path in `config.ini` (field: `base_model_weights_file`)
-3. Create & source a new virtualenv. Python >= **3.6** is required.
-4. Install dependencies by running `pip3 install -r requirements.txt`.
-5. Copy sample_config.ini to config.ini, you may customize `batch_size` and training parameters here. Try to set `patience_reduce_lr` to 2 or 3 in the early training phase. Please note config.ini must exist before training and testing 
-6. Run `python train.py` to train a new model. If you want to run the training using multiple GPUs, just prepend `CUDA_VISIBLE_DEVICES=0,1,...` to restrict the GPU devices. `nvidia-smi` command will be helpful if you don't know which device are available.
-7. Run `python test.py` to test the model.
-
-## CAM
-Reference: [Grad-CAM](https://arxiv.org/pdf/1610.02391). CAM image is generated as accumumlated weighted activation before last global average pooling (GAP) layer. It is scaled up to 224\*224 to match original image.
-```
-python test.py
-```
-CAM images will be generated into $pwd/imgdir, please make sure you've created the target directory before running test.py
+1. Download **all tar files**, **Data_Entry_2017.csv** and **BBox_List_2017.csv** of ChestX-ray14 dataset from [NIH dropbox](https://nihcc.app.box.com/v/ChestXray-NIHCC). Put them under `./data` folder and untar all tar files.
+2. Create & source a new virtualenv. Python >= **3.6** is required.
+3. Install dependencies by running `pip3 install -r requirements.txt`.
+4. Copy sample_config.ini to config.ini, you may customize `batch_size` and training parameters here. Make sure config.ini is configured before you run training or testing
+5. Run `python train.py` to train a new model. If you want to run the training using multiple GPUs, just prepend `CUDA_VISIBLE_DEVICES=0,1,...` to restrict the GPU devices. `nvidia-smi` command will be helpful if you don't know which device are available.
+6. Run `python test.py` to evaluate your model on the test set.
+7. Run `python cam.py` to generate images with class activation mapping overlay and the ground bbox. The ground truth comes from the **BBox_List_2017.csv** file so make sure you have that file in `./data` folder. CAM images will be placed under the output folder.
 
-Guided back-prop is still an enhancement item.
-
-The function is merged into test.py so you wouldn't need test_cam.py anymore. The script will use argmax to plot CAM of the most probable diagnosis only. This version does not support multi-labeled instance at this point.
+### Important notice on CUDA users
+If you use >= CUDA 9, make sure you set tensorflow_gpu >= 1.5.
 
 ## TODO
-1. More baseline models
+1. Frontend
 
 ## Acknowledgement
 I would like to thank Pranav Rajpurkar (Stanford ML group) and Xinyu Weng (北京大學) for sharing their experiences on this task. Also I would like to thank Felix Yu for providing DenseNet-Keras source code.
 
@@ -0,0 +1,8 @@
+from imgaug import augmenters as iaa
+
+augmenter = iaa.Sequential(
+    [
+        iaa.Fliplr(0.5),
+    ],
+    random_order=True,
+)
@@ -8,32 +8,14 @@
 from sklearn.metrics import roc_auc_score
 
 
-def load_generator_data(generator, steps, class_num):
-    """
-    Return some data collected from a generator, use this to ensure all images
-    are processed by exactly the same steps in the customized ImageDataGenerator.
-
-    """
-    batches_x = []
-    batches_y_classes = []
-    for i in range(class_num):
-        batches_y_classes.append([])
-    for i in range(steps):
-        batch_x, batch_y = next(generator)
-        batches_x.append(batch_x)
-        for c, batch_y_class in enumerate(batch_y):
-            batches_y_classes[c].append(batch_y_class)
-    return np.concatenate(batches_x, axis=0), [np.concatenate(c, axis=0) for c in batches_y_classes]
-
-
 class MultipleClassAUROC(Callback):
     """
     Monitor mean AUROC and update model
     """
-    def __init__(self, generator, steps, class_names, weights_path, stats=None):
+    def __init__(self, sequence, class_names, weights_path, stats=None, workers=1):
         super(Callback, self).__init__()
-        self.generator = generator
-        self.steps = steps
+        self.sequence = sequence
+        self.workers = workers
         self.class_names = class_names
         self.weights_path = weights_path
         self.best_weights_path = os.path.join(
@@ -73,14 +55,14 @@ def on_epoch_end(self, epoch, logs={}):
         y_hat shape: (#samples, len(class_names))
         y: [(#samples, 1), (#samples, 1) ... (#samples, 1)]
         """
-        x, y = load_generator_data(self.generator, self.steps, len(self.class_names))
-        y_hat = self.model.predict(x)
+        y_hat = self.model.predict_generator(self.sequence, workers=self.workers)
+        y = self.sequence.get_y_true()
 
         print(f"*** epoch#{epoch + 1} dev auroc ***")
         current_auroc = []
         for i in range(len(self.class_names)):
             try:
-                score = roc_auc_score(y[i], y_hat[i])
+                score = roc_auc_score(y[:, i], y_hat[:, i])
             except ValueError:
                 score = 0
             self.aurocs[self.class_names[i]].append(score)
 
@@ -0,0 +1,147 @@
+import cv2
+import numpy as np
+import os
+import pandas as pd
+from configparser import ConfigParser
+from generator import AugmentedImageSequence
+from models.keras import ModelFactory
+from keras import backend as kb
+
+
+def get_output_layer(model, layer_name):
+    # get the symbolic outputs of each "key" layer (we gave them unique names).
+    layer_dict = dict([(layer.name, layer) for layer in model.layers])
+    layer = layer_dict[layer_name]
+    return layer
+
+
+def create_cam(df_g, output_dir, image_source_dir, model, generator, class_names):
+    """
+    Create a CAM overlay image for the input image
+
+    :param df_g: pandas.DataFrame, bboxes on the same image
+    :param output_dir: str
+    :param image_source_dir: str
+    :param model: keras model
+    :param generator: generator.AugmentedImageSequence
+    :param class_names: list of str
+    """
+    file_name = df_g["file_name"]
+    print(f"process image: {file_name}")
+
+    # draw bbox with labels
+    img_ori = cv2.imread(filename=os.path.join(image_source_dir, file_name))
+
+    label = df_g["label"]
+    if label == "Infiltrate":
+        label = "Infiltration"
+    index = class_names.index(label)
+
+    output_path = os.path.join(output_dir, f"{label}.{file_name}")
+
+    img_transformed = generator.load_image(file_name)
+
+    # CAM overlay
+    # Get the 512 input weights to the softmax.
+    class_weights = model.layers[-1].get_weights()[0]
+    final_conv_layer = get_output_layer(model, "bn")
+    get_output = kb.function([model.layers[0].input], [final_conv_layer.output, model.layers[-1].output])
+    [conv_outputs, predictions] = get_output([np.array([img_transformed])])
+    conv_outputs = conv_outputs[0, :, :, :]
+
+    # Create the class activation map.
+    cam = np.zeros(dtype=np.float32, shape=(conv_outputs.shape[:2]))
+    for i, w in enumerate(class_weights[index]):
+        cam += w * conv_outputs[:, :, i]
+    # print(f"predictions: {predictions}")
+    cam /= np.max(cam)
+    cam = cv2.resize(cam, img_ori.shape[:2])
+    heatmap = cv2.applyColorMap(np.uint8(255 * cam), cv2.COLORMAP_JET)
+    heatmap[np.where(cam < 0.2)] = 0
+    img = heatmap * 0.5 + img_ori
+
+    # add label & rectangle
+    # ratio = output dimension / 1024
+    ratio = 1
+    x1 = int(df_g["x"] * ratio)
+    y1 = int(df_g["y"] * ratio)
+    x2 = int((df_g["x"] + df_g["w"]) * ratio)
+    y2 = int((df_g["y"] + df_g["h"]) * ratio)
+    cv2.rectangle(img, (x1, y1), (x2, y2), (255, 0, 0), 2)
+    cv2.putText(img, text=label, org=(5, 20), fontFace=cv2.FONT_HERSHEY_SIMPLEX,
+                fontScale=0.8, color=(0, 0, 255), thickness=1)
+    cv2.imwrite(output_path, img)
+
+
+def main():
+    # parser config
+    config_file = "./config.ini"
+    cp = ConfigParser()
+    cp.read(config_file)
+
+    # default config
+    output_dir = cp["DEFAULT"].get("output_dir")
+    base_model_name = cp["DEFAULT"].get("base_model_name")
+    class_names = cp["DEFAULT"].get("class_names").split(",")
+    image_source_dir = cp["DEFAULT"].get("image_source_dir")
+    image_dimension = cp["TRAIN"].getint("image_dimension")
+
+    # parse weights file path
+    output_weights_name = cp["TRAIN"].get("output_weights_name")
+    weights_path = os.path.join(output_dir, output_weights_name)
+    best_weights_path = os.path.join(output_dir, f"best_{output_weights_name}")
+
+    # CAM config
+    bbox_list_file = cp["CAM"].get("bbox_list_file")
+    use_best_weights = cp["CAM"].getboolean("use_best_weights")
+
+    print("** load model **")
+    if use_best_weights:
+        print("** use best weights **")
+        model_weights_path = best_weights_path
+    else:
+        print("** use last weights **")
+        model_weights_path = weights_path
+    model_factory = ModelFactory()
+    model = model_factory.get_model(
+        class_names,
+        model_name=base_model_name,
+        use_base_weights=False,
+        weights_path=model_weights_path)
+
+    print("read bbox list file")
+    df_images = pd.read_csv(bbox_list_file, header=None, skiprows=1)
+    df_images.columns = ["file_name", "label", "x", "y", "w", "h"]
+
+    print("create a generator for loading transformed images")
+    cam_sequence = AugmentedImageSequence(
+        dataset_csv_file=os.path.join(output_dir, "test.csv"),
+        class_names=class_names,
+        source_image_dir=image_source_dir,
+        batch_size=1,
+        target_size=(image_dimension, image_dimension),
+        augmenter=None,
+        steps=1,
+        shuffle_on_epoch_end=False,
+    )
+
+    image_output_dir = os.path.join(output_dir, "cam")
+    if not os.path.isdir(image_output_dir):
+        os.makedirs(image_output_dir)
+
+    print("create CAM")
+    df_images.apply(
+        lambda g: create_cam(
+            df_g=g,
+            output_dir=image_output_dir,
+            image_source_dir=image_source_dir,
+            model=model,
+            generator=cam_sequence,
+            class_names=class_names,
+        ),
+        axis=1,
+    )
+
+
+if __name__ == "__main__":
+    main()