We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello,
After 700 k iterations agent just learns to go all in all the time. Maybe the reward architecture should be different?
What is your experience?
Best regards, Roberts
The text was updated successfully, but these errors were encountered:
Me too. After blind all in, the agent starts to play seriously and even wins dozens of times...... Weird
Sorry, something went wrong.
Doesn't that mean the agent is really good?
No branches or pull requests
Hello,
After 700 k iterations agent just learns to go all in all the time. Maybe the reward architecture should be different?
What is your experience?
Best regards,
Roberts
The text was updated successfully, but these errors were encountered: