Question on multiplying constant to sphereR_H, sphereR_N and sphereR_N #18

lizhenstat · 2023-04-14T11:47:31Z

Hi, thanks for sharing the code of your great work on SphereFace Revived. I have a related question:
when using the three proposed normalization methods, why do you scaling a constant on the cross entropy loss
in sphereR_N,sphereR_H and sphereR_S.

Any help would be appreciated, thanks!

ydwen · 2023-04-14T12:46:08Z

lw is loss weight, controlling the loss scale.

lizhenstat · 2023-04-16T06:40:30Z

thanks a lot, I found an interesting answer why scaling loss does affect the training result.
It seems that scaling the loss under SGD and no regularization equals scaling the learning rate.

lizhenstat mentioned this issue Apr 16, 2023

About lw in SphereFace2 #9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on multiplying constant to sphereR_H, sphereR_N and sphereR_N #18

Question on multiplying constant to sphereR_H, sphereR_N and sphereR_N #18

lizhenstat commented Apr 14, 2023

ydwen commented Apr 14, 2023

lizhenstat commented Apr 16, 2023

Question on multiplying constant to sphereR_H, sphereR_N and sphereR_N #18

Question on multiplying constant to sphereR_H, sphereR_N and sphereR_N #18

Comments

lizhenstat commented Apr 14, 2023

ydwen commented Apr 14, 2023

lizhenstat commented Apr 16, 2023