This is a code implementation of the algorithm Anchor-based HRL (AHRL) presented in the following paper.
Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning (Ruijia Li, Zhiling Cai, Tianyi Huang, William Zhu) https://www.sciencedirect.com/science/article/pii/S0950705121003919
AHRL is tested by MuJoCo and OpenAI gym. Networks are trained using PyTorch.
To train a policy on the Point Maze task by running: python main.py --env PointMaze
To see the performance of a policy trained on the Point Maze task by running: python test.py --env PointMaze
We would like to thank: