Skip to content

Latest commit

 

History

History
23 lines (20 loc) · 1.5 KB

File metadata and controls

23 lines (20 loc) · 1.5 KB

Double Deep Q-Learning (DDQN)

Deep Q-Learning since Q-learning involves learning estimates from estimates, such overestimation can be problematic. The solution involves using two separate Q-value estimators, each of which is used to update the other. Using these independent estimators, we can unbiased Q-value estimates of the actions selected using the opposite estimator. We can thus avoid maximization bias by disentangling our updates from biased estimates.

Therefore Double Deep Q-Learning is been invented to fix this issue.

Code

python sample_keras.py

Usefull Resources: