Concepts covered
1. Model Based approach
2. Dynamic Programming
3. Policy Iteration (Evaluation and Improvement)
4. Value Iteration
Concepts covered
1. Model Free approach using Monte-Carlo methods
2. Episodic Learning
3. Monte-Carlo Prediction
4. Monte-Carlo Control
5. Eplison Greedy method
3.Cliff Walking Implementation
Concepts covered
1. Model Free approach using Temporal Difference
2. Sarsa for TD-Control
3. Q-learning for TD-Control
Concepts covered
1. Deep Q Learning