IPPO actions almost always the same #240
Unanswered
ginesmoratalla
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, I am new to this framework but I was trying to train IPPO on a custom environment. I do not know wether it is a problem of training or execution, when using the trained actor network in the environment, each agent always picks the same action (e.g., agent_0 always picks action 4). Since this is just the execution part, I just wanted to know if anyone can give me tips on how to debug this. Could it be a problem when training the network, or could it be found here in the execution script?
It is the first train that I do in the env, and it run for 240k steps with 4 agents. (execution script below)
Beta Was this translation helpful? Give feedback.
All reactions