Issue with type conversion for BatchNorm inference in THNN #1296

libfun · 2017-11-10T16:07:33Z

TL;DR:
THNN/BatchNormalization.c, Line 53

nn/lib/THNN/generic/BatchNormalization.c

Line 53 in 8726825

invstd = 1 / sqrt(THTensor_(get1d)(running_var, f) + eps);

The variable eps is double, so THTensor_(get1d)(running_var, f) converts to double, so whole 1 / sqrt thing becomes double, and in the end converts to real, because invstd is real.

If eps is converted to float before, the output should match the expected behavior.

I was trying to convert Pytorch model to Caffe on CPU and all conv layers worked flawlessly and produced equal output tensors, but on BatchNorm layers something broke and the output was very different. After some investigation I found that Caffe implementation produced the same output as direct numpy equivalent in float32, but Pytorch didn't. After setting types to float64 for values described above I was able to reproduce Pytorch batchnorm output with numpy.

This issue breaks direct conversion of Pytorch models to Caffe, and also seems like genuine unexpected behavior, since everything is expected be float32 during both training and inference.

The text was updated successfully, but these errors were encountered:

libfun · 2017-11-10T16:08:29Z

Xposted issue in pytorch: pytorch/pytorch#3624

libfun mentioned this issue Nov 10, 2017

Issue with type conversion for BatchNorm inference in THNN pytorch/pytorch#3624

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with type conversion for BatchNorm inference in THNN #1296

Issue with type conversion for BatchNorm inference in THNN #1296

libfun commented Nov 10, 2017 •

edited

Loading

libfun commented Nov 10, 2017

Issue with type conversion for BatchNorm inference in THNN #1296

Issue with type conversion for BatchNorm inference in THNN #1296

Comments

libfun commented Nov 10, 2017 • edited Loading

libfun commented Nov 10, 2017

libfun commented Nov 10, 2017 •

edited

Loading