ShreeshaN / ReinforcementLearningTutorials Star 3 Code Issues Pull requests This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient q-learning pytorch dqn epsilon-greedy breakout sarsa policy-iteration value-iteration monte-carlo-methods deep-q-learning model-based-rl model-free-rl td-methods model-free-control Updated Dec 8, 2019 Python
kanji95 / Topics-in-Machine-Learning-CS7.502 Star 1 Code Issues Pull requests Topics in Machine Learning @ IIIT Hyderabad (Fall 2021) policy-iteration value-iteration inverse-reinforcement-learning monte-carlo-methods actor-critic-methods td-methods vanilla-policy-gradient Updated Apr 25, 2022 Jupyter Notebook
katnoria / td-methods Star 1 Code Issues Pull requests Notebooks covering temporal difference methods using OpenAI Gym reinforcement-learning gym temporal-difference-algorithms reinforcem td-methods Updated Apr 17, 2019 Jupyter Notebook
antonio-f / TD-methods-SARSA Star 0 Code Issues Pull requests Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment. machine-learning algorithm reinforcement-learning simple openai-gym gym sarsa 101 gym-environment temporal-difference cliffwalking td-methods sarsa-algorithm Updated Jul 10, 2019 Jupyter Notebook