Implementation in pytorch #15

jeong-tae · 2018-05-23T05:37:46Z

Hi

i am working on implementation to reproduce this paper with pytorch.
But stuck in the pre-train a APN network.

Original code doesn't give the details about learning a APN network, step2.
Also condition about convergence. if loss fluctuate forever, when should i stop to train?

Anyone progress in reproducing this? Test code are 100% useless to reproduce this results.
How can we try RACNN on other public dataset?

If anyone who interested in reproducing this, plz contact me. we can discuss further about training details

Ostnie · 2018-05-23T06:34:51Z

@jeong-tae Hi, I'm also trying to implemention to reproduce this paper with tensorflow, and I also have some trouble about APN。 For your question, I think we should use earlystopping when we trained.

besides this, I have some doubt about APN, As I understand. The input is a batch of images and we will get a set of points(tx,ty,tl) for the segmentation area, so should we use these three dimensional points to cut the current batch of pictures for training? If so, when can we use the next batch of data ?

jeong-tae · 2018-05-23T09:05:45Z

@Ostnie i think we use the points to crop the current batch. the points are about current image. so it must be.
i am not sure where are you confusing now

actually i did the early stopping for APN pretrain. but... when? loss does not converging well.

Ostnie · 2018-05-23T13:35:58Z

@jeong-tae As you said, we should cut current image, and send it to the VGG19, then we use it's loss to modify the APN parameters. Then we will get three new points, shall we still repeat the steps before ?

I'm really confused about the loss of APN, I'm not sure how to calculate it. I guess it depends on the classification of VGG19. As the formula 8, loss=rankloss +crossentropyLoss ,is it ?

jeong-tae · 2018-05-24T00:39:10Z

following the paper, we should repeat two times. The losses are not backpropagated togather. rank loss is for APN, entropy loss for conv/classifier.

As authors said, it should calculated in alternative way

Ostnie · 2018-05-24T02:21:51Z

@jeong-tae Yes you are right, then I have some doubts about rank loss, is it calculated by the output of the softmax layers in vgg19？I think it is strange because the loss contain some information about it's network's parameters. Can we use vgg's loss to modify the APN? I don't know how to do this, could you plz show me some code about this?

jeong-tae · 2018-05-24T03:13:12Z

Yes it is. you can use the output of the softmax layer.
rank_loss = (pred[i]-pred[i+1] + 0.05).clamp(min = 0)
i calculated the loss like this. Why can't we use the loss that contains the network parameters?

i think the purpose of the rank loss is to fill the gap between scales performances. by doing this, APN will propose the more precise region to increase the performance at each scale.

Ostnie · 2018-05-24T05:16:14Z

When I learned the back propagation algorithm. Loss is not just a number that shows how much the difference between the pred and the truth, it also contains information about the impact of each parameter on the final loss in the network. If we use the loss value of VGG, then the loss does not contain APN information in it, although they share most layers, but the last few A fully connected layer is independent of each other. In other words, if you give me a loss value of VGG and let me back propagation to calculate how to optimize the parameters of APN, I don't think it can be done.

I think I may be wrong, but based on the back propagation algorithm I have deduced, I really can not understand this method.

jeong-tae · 2018-05-24T05:31:48Z

The rank loss is the gap between VGG1 and VGG2. You can easily imagine the meta-learning that teach the difference between two networks(in this cage VGG1 and VGG2). And the gap is occured in different scales with attention. So APN learn the attention where should we focus. if gap is large enough, the APN will try to reduce that gap by the proposing a attention.

Ostnie · 2018-05-24T06:34:44Z

@jeong-tae This makes me confused，it seems to be right, but how can I get VGG's loss backpropagation to APN? I can't understand it and it really upset me.

In tensorflow, I don't know how to set APN's loss as VGG‘s loss, could you plz show me how pytorch accomplished this step?

jeong-tae · 2018-05-24T09:10:10Z

oh, you mean backpropagation for APN?
i actually implement the backward code following the caffe code, which is in attention crop layer.

i will finish the code work so soon and make it public. Then you can see the whole process as well!

Ostnie · 2018-05-24T12:23:10Z

@jeong-tae https://github.com/Charleo85/DeepCar this library may help you, it is written in pytorch.

jeong-tae · 2018-05-25T00:48:13Z

@Ostnie oh, very nice! thx!

jeong-tae · 2018-05-28T06:09:15Z

@Ostnie i publish the code and need some helps. If you still interested in implementation with other framework. come to here https://github.com/jeong-tae/RACNN-pytorch and work together.

Ostnie · 2018-05-29T08:16:02Z

@jeong-tae Oh，great ， I will study it soon， but I'm not familiar with pytorch, let's have a try first !

jackshaw · 2018-07-15T08:40:55Z

Hi,@jeong-tae,I'm trying to reproduce RA-CNN too.I have some doubt about the data preprocessing.In pytorch,the pixesl of images will be rescaled to 0 between 255,which is different from that in caffe.Do you think this difference will inluence the performance ?

jeong-tae · 2018-07-15T14:12:04Z

@jackshaw hello, jachshaw
I am not sure what you mean. Do you mean normalization? or subtract mean?
Whatever you do, it will not effect too much... maybe. But actually it influence to performance.

https://stackoverflow.com/questions/4674623/why-do-we-have-to-normalize-the-input-for-an-artificial-neural-network
This reply will help you to understand data preprocessing

jackshaw · 2018-07-16T07:37:51Z

@jeong-tae Thanks very much for your reply. Did you ever tried the available caffe pretrained model?I can only get 74% accuracy far from 85%. I think I must miss some important details when preparing my test data, but I can not figure out what details I've missed. I just resized the shortest side of each image and then converted the resized image to lmdb format.

jeong-tae · 2018-07-18T00:18:12Z

Nope. i didn't. In pytorch, there is image resize preprocessing that used in the paper.
You can easily find it in the pytorch docs.

bluemandora · 2018-07-20T08:16:47Z

@jeong-tae I think step 2 is something like:

Initialize the network with VGG pre-trained from ImageNet.
Forward propagation the images and get the feature maps after conv5_4.
Find a square(x, y, l) with half length of the original image and maximum the sum of value in corresponding area in the feature map.
Train the APN network(only APN part) with ground truth(x, y, l) and some loss like MSE.

jeong-tae · 2018-07-20T09:29:36Z

i think so too exactly same!
i tried with that way but i can’t reproduce the result. i will soon try again.

lmy418lmy · 2019-09-19T12:00:20Z

Could you send me the source code with caffe?

flash1803 · 2020-06-16T14:08:59Z

@jeong-tae I think step 2 is something like:

Initialize the network with VGG pre-trained from ImageNet.

Forward propagation the images and get the feature maps after conv5_4.

Find a square(x, y, l) with half length of the original image and maximum the sum of value in corresponding area in the feature map.

Train the APN network(only APN part) with ground truth(x, y, l) and some loss like MSE.

How can I get the ground truth(x,y,l) ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation in pytorch #15

Implementation in pytorch #15

jeong-tae commented May 23, 2018

Ostnie commented May 23, 2018 •

edited

Loading

jeong-tae commented May 23, 2018

Ostnie commented May 23, 2018 •

edited

Loading

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018 •

edited

Loading

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 25, 2018

jeong-tae commented May 28, 2018

Ostnie commented May 29, 2018

jackshaw commented Jul 15, 2018

jeong-tae commented Jul 15, 2018

jackshaw commented Jul 16, 2018

jeong-tae commented Jul 18, 2018

bluemandora commented Jul 20, 2018

jeong-tae commented Jul 20, 2018

lmy418lmy commented Sep 19, 2019

flash1803 commented Jun 16, 2020

Implementation in pytorch #15

Implementation in pytorch #15

Comments

jeong-tae commented May 23, 2018

Ostnie commented May 23, 2018 • edited Loading

jeong-tae commented May 23, 2018

Ostnie commented May 23, 2018 • edited Loading

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018 • edited Loading

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 24, 2018

Ostnie commented May 24, 2018

jeong-tae commented May 25, 2018

jeong-tae commented May 28, 2018

Ostnie commented May 29, 2018

jackshaw commented Jul 15, 2018

jeong-tae commented Jul 15, 2018

jackshaw commented Jul 16, 2018

jeong-tae commented Jul 18, 2018

bluemandora commented Jul 20, 2018

jeong-tae commented Jul 20, 2018

lmy418lmy commented Sep 19, 2019

flash1803 commented Jun 16, 2020

Ostnie commented May 23, 2018 •

edited

Loading

Ostnie commented May 23, 2018 •

edited

Loading

Ostnie commented May 24, 2018 •

edited

Loading