Jax gradients of stateful flax operations (batchnorm) #339

virajmehta · 2020-06-25T16:07:54Z

virajmehta
Jun 25, 2020

Hi,

I’d like to use batch norm in training normalizing flows along with other things. It seems like your implementation of batch norm (and mine) needs state which is held outside of the main training loop (the batch statistics). However, I’m running into trouble, as when I take JAX gradients flax throws

ValueError: Stateful operations are not allowed when the Collection is created outside of the current Jax transformation

I don’t see an example of Batch norm being used in a training loop rather than an inference setting as in the docs. Can you please advise on the correct workaround for this?

This also doubles as a documentation suggestion as if it wasn’t clear to me it may be unclear to others. Thanks!

virajmehta · 2020-06-29T14:11:23Z

virajmehta
Jun 29, 2020
Author

never mind, figured it out, but I think that this may be an area of improvement for documentation. Thanks!

0 replies

avital · 2020-06-30T12:35:02Z

avital
Jun 30, 2020

Hi Viraj, I converted this issue to a conversation -- do you mind sharing how you ended up solving your issue so that others can also benefit?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jax gradients of stateful flax operations (batchnorm) #339

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Jax gradients of stateful flax operations (batchnorm) #339

virajmehta Jun 25, 2020

Replies: 2 comments

virajmehta Jun 29, 2020 Author

avital Jun 30, 2020

virajmehta
Jun 25, 2020

virajmehta
Jun 29, 2020
Author

avital
Jun 30, 2020