Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

多臂老虎机ε - 贪心算法 解释部分有问题 #84

Open
gymdarius opened this issue Aug 15, 2024 · 0 comments
Open

多臂老虎机ε - 贪心算法 解释部分有问题 #84

gymdarius opened this issue Aug 15, 2024 · 0 comments

Comments

@gymdarius
Copy link

在第一幅 ε =0.01部分的图像中,为了说明算法的累计懊悔几乎是线性增长的,书中提到一句 因为一旦做出了随机拉杆的探索,那么产生的懊悔值是固定的。 。 我尝试打印出regrets数组的值
image
可以明显看出 每次产生的懊悔值是不固定的,不知道是编者编写时候的疏忽还是我理解有问题,请关注一下这个问题,感谢!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant