This repository contains my solutions to the assignments for the course on Reinforcement Learning (Spring 2022), taught by Dr. P.B. Sujit at IISER Bhopal. The topics dealt with in these assignments are as follows:
- Assignment 1: Multi-armed bandit problem and dynamic programming
- Assignment 2: Grid-world design for Sokoban
- Assignment 3: Monte-Carlo method and Temporal Difference Learning for solving Sokoban