Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 247 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 247 Bytes

HRL_taxi

MAXQ_0:

Hierarchical reinforcement learning algorithm from "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition" by T. G. Dietterich for solving Taxi-v3 environment from https://gym.openai.com/envs/Taxi-v3/