[Question] Not updating lstm states during training #265

abhinavj98 · 2024-11-21T07:07:27Z

❓ Question

In training PPO-Recurrent over different epochs we do not update the LSTM states even though the LSTM weights get updated. Is there a reason to do so? Or is it just to save compute and does not effect the optimization process a lot?

https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/sb3_contrib/ppo_recurrent/ppo_recurrent.py#L345-L349

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
If code there is, it is minimal and working
If code there is, it is formatted using the markdown code blocks for both code and stack traces.

araffin · 2024-11-27T09:17:40Z

Is there a reason to do so?

Simplicity.

Or is it just to save compute and does not effect the optimization process a lot?

yes.

They are mostly used to get a better initialization of the hidden state of the LSTM.
(and also, the updated LSTM should not be too far in parameter space to the old LSTM used to collect the data)

abhinavj98 added the question Further information is requested label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Not updating lstm states during training #265

[Question] Not updating lstm states during training #265

abhinavj98 commented Nov 21, 2024

araffin commented Nov 27, 2024

[Question] Not updating lstm states during training #265

[Question] Not updating lstm states during training #265

Comments

abhinavj98 commented Nov 21, 2024

❓ Question

Checklist

araffin commented Nov 27, 2024