Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Should predicted observations be used for computing intrinsic value term in the likelihood AIF? #1

Open
sai-prasanna opened this issue May 23, 2024 · 1 comment

Comments

@sai-prasanna
Copy link

I noticed that you use preferred observations for computing the intrinsic value term in the likelihood AIF. But from what I understand the preferred observations should be used only for the extrinsic value term.

_, posterior_states = self.wm.posterior(obs_embed=self.wm.obs_encoder(preferred_obs).expand(batch_b*batch_t, self.wm.obs_encoder.embed_size), prev_action=None, prev_state=init_states, is_init=True)

image

@mazpie
Copy link
Owner

mazpie commented Jun 6, 2024

Hi @sai-prasanna,

thanks for noting this bug in the public version of the code! I checked the original repo, and the code looks like this:

# compute intrinsic value
embed = self.obs_encoder(predicted_obs)
_, posterior_states = self.posterior(embed, actions, prior_states, is_init=True)

I currently have no time to re-test this version with the change myself, so I may do it later in the future.
If you are currently working with the repo, it would be great if you could test it and contribute the fix yourself!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants