densenet not training when using tf.contrib.layers.recompute_grad #21

Sirius083 · 2019-04-15T07:44:23Z

I want to implement memory efficient densenet, following the code in
https://github.com/joeyearsley/efficient_densenet_tensorflow/blob/master/models/densenet_creator.py, the traing process is stuck at first epoch
I have just changed the add_layer part

        def add_layer(l):

            def _add_layer(l):
                shape = l.get_shape().as_list()
                in_channel = shape[3]
                with tf.variable_scope(name) as scope:
                    c = BatchNorm('bn1', l)
                    c = tf.nn.relu(c)
                    c = conv('conv1', c, self.growthRate, 1)
                    l = tf.concat([c, l], 3)
                return l
            
            if self.efficient:
                _add_layer = tf.contrib.layers.recompute_grad(_add_layer)
            
            return _add_layer(l)

also add the key word argument "efficient" to specify whether use the memory efficient version.
However the training process stucked.
Using tensorflow 1.9
tensorpack 0.9.1
Do I need to change other parts in the tensorpack?
Thanks in advance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

densenet not training when using tf.contrib.layers.recompute_grad #21

densenet not training when using tf.contrib.layers.recompute_grad #21

Sirius083 commented Apr 15, 2019

densenet not training when using tf.contrib.layers.recompute_grad #21

densenet not training when using tf.contrib.layers.recompute_grad #21

Comments

Sirius083 commented Apr 15, 2019