Vanishing gradient issue in APN #13

pjj4288 · 2018-05-01T10:56:13Z

I am trying to re-implement this experiment in pytorch.
However, weights of APN(Attention Proposal Network) aren't updated because of extremely low gradients.
I think this issue is from logistic function of eq(5). It looks like a flat region of logistic function makes gradients almost zero.

In the paper, authors pretrained APN using last cnn features. Did you record the performance without this initialization?

Thank you.

Ostnie · 2018-05-23T08:35:21Z

@pjj4288 Could you please tell me how to create APN? I don't know what loss and clipping should be .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vanishing gradient issue in APN #13

Vanishing gradient issue in APN #13

pjj4288 commented May 1, 2018 •

edited

Loading

Ostnie commented May 23, 2018

Vanishing gradient issue in APN #13

Vanishing gradient issue in APN #13

Comments

pjj4288 commented May 1, 2018 • edited Loading

Ostnie commented May 23, 2018

pjj4288 commented May 1, 2018 •

edited

Loading