An implementation of Q algorithm and its deep variant for Reinforcement Learning purposes.
This was a part of Charles Isbell, Michael Littman and Pushkar Kolhe Udacity https://www.udacity.com/course/machine-learning-reinforcement-learning--ud820 Reinforcement Learning course.
Other resources
-
Richard S. Sutton and Andrew G. Barto Book