#

td-methods

Here are 4 public repositories matching this topic...

ShreeshaN / ReinforcementLearningTutorials

This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient

q-learning pytorch dqn epsilon-greedy breakout sarsa policy-iteration value-iteration monte-carlo-methods deep-q-learning model-based-rl model-free-rl td-methods model-free-control

Updated Dec 8, 2019
Python

katnoria / td-methods

Notebooks covering temporal difference methods using OpenAI Gym

reinforcement-learning gym temporal-difference-algorithms reinforcem td-methods

Updated Apr 17, 2019
Jupyter Notebook

kanji95 / Topics-in-Machine-Learning-CS7.502

Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)

policy-iteration value-iteration inverse-reinforcement-learning monte-carlo-methods actor-critic-methods td-methods vanilla-policy-gradient

Updated Apr 25, 2022
Jupyter Notebook

antonio-f / TD-methods-SARSA

Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.

machine-learning algorithm reinforcement-learning simple openai-gym gym sarsa 101 gym-environment temporal-difference cliffwalking td-methods sarsa-algorithm

Updated Jul 10, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the td-methods topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the td-methods topic, visit your repo's landing page and select "manage topics."