Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you please add the way to deal with rewards with same steps in a multi processes training? #3

Open
qiuruiyu opened this issue Oct 9, 2023 · 2 comments

Comments

@qiuruiyu
Copy link

qiuruiyu commented Oct 9, 2023

In my training process, I use a multi-processes PPO.
When I want to draw the reward curve with rl-plotter, I found that:
image
Just like the image, there are rewards with the same steps. But it seems that it only shows one point in the curve?

@Xiong5Heng
Copy link

Hi, I also meet the same challenge, have you solved it?

@qiuruiyu
Copy link
Author

Hi, I also meet the same challenge, have you solved it?

No... I didn't manage to solve the problem because it seems that the curve looks no problem..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants