Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log rewards statistics in SAC agents. #232

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

wookayin
Copy link
Contributor

@wookayin wookayin commented May 23, 2022

PPO agents have reward_mean and rewards_std metrics logged, but SAC agents do not have.

Note that the SAC implementation is not so flexible that custom metrics cannot be configured or extended (because update_step is not a method), so it would be reasonable to add them directly into the update_step function.

@wookayin
Copy link
Contributor Author

I'm submitting this as a separate PR other than #231 to ease the review process, but it'd be great if each of the PRs can be rebased/merged without creating a merge commit when being merged. Thanks!

(similar to PPO agents)
@wookayin
Copy link
Contributor Author

wookayin commented Jul 5, 2022

@qstanczyk, Thank you for the update on the PR after a long time! I understand DM may not have enough resources available, but as a general request, It'd be greatly appreciated if the turnaround time for community contributions could be reduced further.

copybara-service bot pushed a commit that referenced this pull request Jul 5, 2022
--
5e38355 by Jongwook Choi <wookayin@gmail.com>:

Log rewards statistics in SAC agents

(similar to PPO agents)

COPYBARA_INTEGRATE_REVIEW=#232 from wookayin:sac-log-reward 5e38355
PiperOrigin-RevId: 459035815
Change-Id: I47181e44776cac8f76ebbbf95fbcf46ccb40c6d7
cop4587 pushed a commit to cop4587/acme-molax that referenced this pull request Jul 5, 2022
--
5e38355 by Jongwook Choi <wookayin@gmail.com>:

Log rewards statistics in SAC agents

(similar to PPO agents)

COPYBARA_INTEGRATE_REVIEW=google-deepmind#232 from wookayin:sac-log-reward 5e38355
PiperOrigin-RevId: 459035815
Change-Id: I47181e44776cac8f76ebbbf95fbcf46ccb40c6d7
alex-gran23 pushed a commit to Next-Step-Fusion/acme that referenced this pull request Feb 23, 2024
--
5e38355 by Jongwook Choi <wookayin@gmail.com>:

Log rewards statistics in SAC agents

(similar to PPO agents)

COPYBARA_INTEGRATE_REVIEW=google-deepmind#232 from wookayin:sac-log-reward 5e38355
PiperOrigin-RevId: 459035815
Change-Id: I47181e44776cac8f76ebbbf95fbcf46ccb40c6d7
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants