Solution to loss explosion #13

fkeufss · 2022-07-30T02:28:03Z

Thank you for sharing your code. I am trying your code and I do find the loss explosion problem. Do you know the inherent reason of it? Is there any better solution instead of restarting training with lower learning rate every time manually?

TomTomTommi · 2022-07-30T07:42:38Z

Hi, thanks for your interest. Actually, this problem occurs frequently and deserves further study, but I have not analyzed it.

Hatermelon · 2022-09-14T02:44:43Z

Thank you for sharing your code. I am trying your code and I do find the loss explosion problem. Do you know the inherent reason of it? Is there any better solution instead of restarting training with lower learning rate every time manually?

Hello, can you continue the training normally after modifying the parameters manually? I am using the manual method to modify the loss explosion problem for the first time, why after modifying the learning rate and other parameters according to the method, the model re the first round started and did not continue for 500 epochs, the learning rate did not change according to the modifications, is it something I have overlooked? Thank you.

lyq2335458686 · 2023-04-20T12:14:13Z

Hello, when I run your code, I obviously downloaded CUDA, but why can't I call the GPU when running?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solution to loss explosion #13

Solution to loss explosion #13

fkeufss commented Jul 30, 2022

TomTomTommi commented Jul 30, 2022

Hatermelon commented Sep 14, 2022

lyq2335458686 commented Apr 20, 2023

Solution to loss explosion #13

Solution to loss explosion #13

Comments

fkeufss commented Jul 30, 2022

TomTomTommi commented Jul 30, 2022

Hatermelon commented Sep 14, 2022

lyq2335458686 commented Apr 20, 2023