Replies: 2 comments 1 reply
-
Are you trying to find the best learning rate for each iteration of the training loop? Not sure if there's a better way to do it than the code example you provided. I wonder if there's a way to write a custom optax scheduler that factors in the resulting loss depending on which learning rate you use? |
Beta Was this translation helpful? Give feedback.
1 reply
-
You can probably |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm working on some code where I'm trying to see if it helps to incorporate some sort of manual line search for the choice of step-size used. I'm checking across a small list of step-sizes to check the best step-size to take. This is the way I've got it currently implemented.
Is there a better way to do this? I thought I could maybe vmap across the step-sizes but that does blow up the memory a fair bit
Beta Was this translation helpful? Give feedback.
All reactions