Custom training routines #3

dribnet · 2016-02-17T16:26:18Z

These series of commits adds two new experiments which act as general purpose tools as discussed in #2, which this pull request is meant to replace. It adds a more generic version of train_classifier and train_vae, which are intended to be compatible with the celeba versions but with many additional options.

Command line options for things that were hard coded like
- --oldmodel to start training from a previously saved state
- --classifier to use a different classifier filename
- --model to use a different model filename
- --batch-size to use different batch-sizes (generally smaller for memory reasons)
- --z-dim to change network architecture allowing different sized latent space
- --monitor-every and --checkpoint-every to change frequency of those events
Split out discriminative_term in the cost function so it could be monitored separately
Ability to scale relative cost of reconstruction, kl, and discriminative
Classifier can train on a subset of labels given at runtime
Can train against compatible fuel datasets other than CelebA

Dropped checkpointing and monitoring in train_celeba_classifier.py from every 5 epochs to every epoch - because the model converges so quickly that even 5 seems like overkill.

The training can now be done on a subset of the 40 available CelebA labels. For example, to train only on "lipstick" and "big lips", just provide the command line option with numbers that match those columns: --allow 6,36 And all other labels will be zeroed out by a Transformer. Also added the ability to specify other options on the command line: --classifier for output file, defaults to "celeba_classifier.zip" --batch-size for size of all mini-batches. defaults to 100

Added cli options: --classifer for input classifier, defaults to "celeba_classifier.zip" --model for output model, defaults to "celeba_vae_regularization.zip" --batch-size for all mini-batches, defaults to 100 --z-dim to control latent dimensionality, defaults to 1000

Added three scaling factors with command line options to change the relative weighting of the loss function: reconstruction_factor, kl_factor, and discriminative_factor.

Added cli options --monitor-every and --checkpoint-every which change how often monitoring and checkpointing occur during training.

Added options to allow swapping out the celeba dataset with any other fuel compatible dataset with 64x64 features. If the dataset is grayscale instead of color, the color-convert option can also be used to dynamically transform the data into color as it comes in. Note that for train_celeba_classifier the custom fuel dataset must also have compatible targets, which is unlikely already the case for common datasets but it certainly possible with some customization. However, for train_celeba_vae any fuel dataset with 64x64 features can be used - either with regularize left off or by using an existing celeba classifier or any other classifier trained on data close enough to the target dataset.

Added cli options --monitor-every and --checkpoint-every which change how often monitoring and checkpointing occur during training.

Added an option to start training from a prevoius model checkpoint. Note that this also involved a change to MainLoop to add the model, which sadly means that previously saved checkpoints are not usable. Also updated train_monitoring options to set before_first_epoch=True as this is useful to verify that the checkpoint was loaded successfully.

Added --oldmodel to start training from a previously saved model state. Increased the width of the labels from 40 to 64 to support different datasets. Will need to write a transformer to deal with filling out from shorter label lengths.

Moved the allowed option used to filter labels from create_celeba_streams to create_custom_streams.

vdumoulin · 2016-02-17T18:23:22Z

discgen/utils.py

@@ -117,6 +124,79 @@ def create_svhn_streams(training_batch_size, monitoring_batch_size):
                          monitoring_batch_size)


+class Colorize(AgnosticSourcewiseTransformer):
+    def __init__(self, data_stream, **kwargs):


Could you add a docstring explaining what this transformer does?

vdumoulin · 2016-02-17T18:48:12Z

@dribnet I did a first review pass. Could you also flake8 the files and fix issues that might have been introduced by your changes?

dribnet · 2016-02-18T01:08:23Z

Thanks @vdumoulin for the constructive feedback, glad to hear you think overall this would be a welcome addition. I'll be reviewing and updating this PR over the next week.

Replaced block of custom code with fuel.utils.find_in_data_path and added comments to several functions.

Formatting updates after running flake8 for better legibility.

If incoming dataset doesn't provide enough labels for standard training, they can be zero-padded with the --stretch option.

dribnet · 2016-02-24T18:37:08Z

@vdumoulin - I've addressed most issues in the previous review including a general flake8 code cleanup.

dribnet added 13 commits February 17, 2016 08:14

train_celeba_classifier: checkpointing and monitoring every epoch

e689381

Dropped checkpointing and monitoring in train_celeba_classifier.py from every 5 epochs to every epoch - because the model converges so quickly that even 5 seems like overkill.

train_celeba_vae: Separated discriminative_term for tracking

71a539d

train_celeba_vae: added scaling factor options

e68ce3e

Added three scaling factors with command line options to change the relative weighting of the loss function: reconstruction_factor, kl_factor, and discriminative_factor.

train_celeba_classifier: cli for checkpoint/monitor frequency

0791113

Added cli options --monitor-every and --checkpoint-every which change how often monitoring and checkpointing occur during training.

train_celeba_vae: cli for checkpoint/monitor frequency

cab06e8

Added cli options --monitor-every and --checkpoint-every which change how often monitoring and checkpointing occur during training.

experiments: rename generic trainers

946e772

recovered original train_celeb experiment scripts from 419687c

a81a90b

utils.py: moved allowed option out of create_celeba_streams

5b3a646

Moved the allowed option used to filter labels from create_celeba_streams to create_custom_streams.

vdumoulin reviewed Feb 17, 2016
View reviewed changes

dribnet added 4 commits February 23, 2016 23:53

light code cleanup: comments and find_in_data_path

082735c

Replaced block of custom code with fuel.utils.find_in_data_path and added comments to several functions.

flake8 cleanup of new files

fd473ab

Formatting updates after running flake8 for better legibility.

train_vae: code cleanup using tensor.zeros_like

9bd9968

train_classifier.py: added --stretch option to zero pad labels

9c56d08

If incoming dataset doesn't provide enough labels for standard training, they can be zero-padded with the --stretch option.

train_vae: light formatting and arg help string

4b253b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom training routines #3

Custom training routines #3

dribnet commented Feb 17, 2016

vdumoulin Feb 17, 2016

vdumoulin commented Feb 17, 2016

dribnet commented Feb 18, 2016

dribnet commented Feb 24, 2016

Custom training routines #3

Are you sure you want to change the base?

Custom training routines #3

Conversation

dribnet commented Feb 17, 2016

vdumoulin Feb 17, 2016

Choose a reason for hiding this comment

vdumoulin commented Feb 17, 2016

dribnet commented Feb 18, 2016

dribnet commented Feb 24, 2016