Skip to content

Solutions to RL exercises proposed in the git repo by Denny Britz

Notifications You must be signed in to change notification settings

rahulptel/RL-Solutions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning Exercises

Solutions to RL exercises proposed in the git repo by Denny Britz.

Dynamic Programming

  • Policy Evaluation
  • Policy Iteration
  • Value Iteration

Monte Carlo methods

  • Monte Carlo Prediction
  • On-policy Monte Carlo Control with e-greedy policy
  • Off-policy Monte Carlo Control with Weighted Importance Sampling

Temporal Difference Learning

  • SARSA (On-policy TD control)
  • Q-learning (Off-policy TD control)

Function approximation

  • Q-learning with linear function approximation

Deep Q Learning

  • Deep Q learning
  • Double Deep Q learning

Policy Gradient

  • REINFORCE with baseline
  • Actor-Critic with baseline
  • Actor-Critic with baseline for continuous action space
  • Asynchornous Advantage Actor-Critic (A3C)

About

Solutions to RL exercises proposed in the git repo by Denny Britz

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published