main_myeloid.py

'''
Deep Dynamic-Routing Convolutional Capsules (DR-ConvCaps), which includes
    - Dense DR-ConvCaps: it uses dense connections within the capsule section.
    - Residual DR-ConvCaps: it uses residual connections within the capsule 
      section.
    - Reduced DR-ConvCaps: it reduces the number of CNNs prior to capsules.

Original Paper by Juan P. Vigueras-Guillen (https://arxiv.org/abs/...)
Code written by: Juan P. Vigueras-Guillen
    Part of the code for DR-ConvCaps is a conversion (from PyTorch to 
    Tensorflow 2.x) of the code made by Lei Yang (yl-1993) and Moein Hasani 
    (Moeinh77): https://github.com/yl-1993/Matrix-Capsules-EM-PyTorch
    Yang and Hasain's code (EM routing) was transformed to DR routing and 
    some changes were made.
If you use significant portions of this code or the ideas from this paper, 
please cite it. If you have any questions, please email me at:
    J.P.ViguerasGuillen@gmail.com


This is the main file for the project, regarding the Myeloid dataset (another 
main script related to the Leukemia dataset is also available). 
From here you can train and test the network. 
Certain manipulations to the DR-ConvCaps network must be done manually here and 
in the network-python file (instructions in the code). Those manipulations are: 
    - The depth of the network (how many capsule layers to include). For this, 
      change the "capsule section" below and add more layers in the network 
      function (file networks.py)
    - Whether to add dense or residual connections (we already created several 
      functions in networks.py, each one dealing with a specific type).
Please see the README for detailed instructions for this project.


# DESCRIPTION OF THE DATA:
(1) We used the dataset from the paper "Human-level recognition of blast cells  
in acute myeloid leukaemia with convolutional neural networks", Nature Machine 
Intelligence 1, 538-544 (2019). 
Link: https://www.nature.com/articles/s42256-019-0101-9
Dataset available at The Cancer Imaging Archive (TCIA):
https://doi.org/10.7937/tcia.2019.36f5o9ld
(2) This dataset contains 18365 images of leukocytes, size 400x400 pixels.
There are 15 unbalanced classes: largest class has 8484 images, whereas the 
smallest one only has 11 images. We selected this dataset because:
    (a) we can prove that large cell images can be solved with DR-ConvCaps;
    (b) we can evaluate how a high unbalanced set is handled by DR-ConvCaps.


# DESCRIPTION OF THE TRAINING:
We tested several ways of balancing the batches. For this, we created the 
concept of 'groups', where classes are set into one group and the batches are 
build by randomly selecting one image from each group. For example:
    (a) Each class has its own group, therefore a batch always contains an 
    example of each class (small classes will be overpresented during training).
    (b) The classes with less examples are grouped together.
To perform the training, we build our own functions to build the batches and we 
used the keras functions 'train_on_batch' and 'test_on_batch'. Functions to keep 
track of the loss and metrics were created. Thus, an epoch here is simply a  
specific number of iterations.
'''

import os
import argparse
import numpy as np 
from split_data import split_dataset_myeloid
from networks_myeloid import CapsNet_DR, Residual_CapsNet_DR, Dense_CapsNet_DR
from train import train_model_myeloid
from test import test_model_myeloid


def launch_experiment(args):
    """ Given the input arguments of the experiment (args), it ...
    (1) loads the data info and split it intro train/vaid/test; 
    (2) builds the network, and load weights if indicated;
    (3) trains the model (if indicated);
    (4) tests the model (if indicated).
    """                   
    trn_img, val_img, tst_img = split_dataset_myeloid(args) 
    model = CapsNet_DR(args.input_shape, args=args)
    model.summary()
    if args.weights:
        model.load_weights(args.save_folder + args.weights) # , by_name=True
    if args.train == 1:
        train_model_myeloid(model, trn_img, val_img, args)
    if args.test == 1:
        test_model_myeloid(model, tst_img, args)
    print('Experiment finished!')


if __name__ == '__main__':
    # Only the arguments that need to be changed regularly are added in the 
    # parser. Otherwise, they are defined below.
    parser = argparse.ArgumentParser(description='Tensorflow-Keras DR-ConvCaps')
    parser.add_argument('--data_folder', type=str, required=True,
                        help='The directory of the data.')
    parser.add_argument('--save_folder', type=str, required=True,
                        help='The directory where to save the data.')
    parser.add_argument('--weights', type=str, default='',
                        help='The mame of trained_model.h5; Set to "" for none.' 
                             'Uses path from args.save_folder')
    parser.add_argument('--epochs', type=int, default=700, 
                        help='The number of epochs for training.')
    parser.add_argument('--current_epoch', type=int, default=0,
                        help='If weights are loaded, set the current epoch.')
    parser.add_argument('--iterations', type=int, default=500,
                        help='The number of iterations for one epoch.')      
    parser.add_argument('--iter2eval', type=int, default=500,
                        help='The iterations to launch validation set.')                
    parser.add_argument('--num_folds', type=int, default=5,
                        help='The number of folds to separate the dataset.')
    parser.add_argument('--test_fold', type=int, default=1,
                        help='The index of the fold to use as test set.')
    parser.add_argument('--vald_fold', type=int, default=1,
                        help='The index of the fold to use as validation set,'
                             'but once the images from the test set have been '
                             'removed; i.e. a second K-split is performed.')
    parser.add_argument('--train', type=int, default=1, choices=[0,1],
                        help='Set to 1 to enable training.')
    parser.add_argument('--test', type=int, default=1, choices=[0,1],
                        help='Set to 1 to enable testing.')
    
    args = parser.parse_args()
    
    # General model
    args.image_shape = (400, 400, 3)   # This is the original image size
    args.input_shape = (400, 400, 3)   # This is the input to the model
    args.scale_input = None  # float, to scale image to input; 'None' or 1. 
    args.crop_input = False  # To crop the image into input_shape (centered) 

    # Learning rate
    args.lr_init = 0.001
    args.lr_decay = 0.99    

    # Capsule section
    args.cap_numbCap = [32,  8,  8,  8,  8, 15]  # Number of capsule per layer
    args.cap_sizeCap = [16, 16, 16, 16, 16, 16]  # Size of the capsule
    args.cap_Kvalues = [ 3,  3,  3,  3,  3,  1]  # Kernel of the ConvCaps
    args.cap_strides = [ 1,  1,  1,  1,  1,  1]  # Strides in the ConvCaps
    args.cap_routing = 3                         # Number of iterations in DR

    # CNN section
    args.drop_rate = None 
    
    # Batches and classes
    # * In our first design, each class has its own group (batch_classes). 
    # * Alternatively, we grouped several classes into one group.
    # * The final batch size can be increased by adding several units of batches
    args.batch_classes = np.arange(15) # Representation of the groups-classes 
    #args.batch_classes = np.array((0, 1, 2, 1, 3, 4, 1, 5, 1, 1, 1, 1, 1, 1, 1))
    args.batch_unit = np.max(args.batch_classes) + 1      
    args.batch_sets = 1    
    args.batch_size = int(args.batch_unit * args.batch_sets)
    args.classes = np.arange(15)   # The labels of the classes
    args.weighting_flag = False    # To whether apply sample weighting based ... 
    args.weighting_class = {0: 1,  # ... on the classes, using this dictionary.
                            1: 1,        
                            2: 1,
                            3: 1, 
                            4: 1, 
                            5: 1, 
                            6: 1, 
                            7: 1, 
                            8: 1, 
                            9: 1, 
                           10: 1, 
                           11: 1, 
                           12: 1, 
                           13: 1, 
                           14: 1} 
    args.class_folders = ['NGS/', 'NGB/', 'LYT/', 'LYA/', 'MON/', 'EOS/', \
                          'BAS/', 'MYO/', 'PMO/', 'PMB/', 'MYB/', 'MMZ/', \
                          'MOB/', 'EBO/', 'KSC/']
    args.class_descrip = ['Neutrophil (Segmented)', 'Neutrophil (band)     ', \
                          'Lymphocite (typical)  ', 'Lymphocite (atypical) ', \
                          'Monocyte              ', 'Eosinophil            ', \
                          'Basophil              ', 'Myeloblast            ', \
                          'Promyelocyte          ', 'Promyelocyte (bilobed)', \
                          'Myelocyte             ', 'Metamyelocyte         ', \
                          'Monoblast             ', 'Erythroblast          ', \
                          'Smudge cell           ']
    args.class_folders = [args.data_folder + ii for ii in args.class_folders]

    # Augmentation
    args.flag_augmentation = True
    args.batch_normalization = False    
    args.batch_standardization = False     

    # Saving the model and variables (we saved the model at each epoch; this 
    # can be changed in train_model.py)
    args.save_model = True
    args.save_variables = True
    args.save_folder = args.save_folder + 'Fold_' + \
        str(args.test_fold).zfill(2) + '/'
    if not os.path.exists(args.save_folder):
        print('Creating the folder... ')
        os.makedirs(args.save_folder)
    else:
        print('Folder already exists! ')
    
    # -------------------------------------------------------------------------
    # Run the experiment
    launch_experiment(args)