Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
-
Updated
Oct 23, 2020 - Jupyter Notebook
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
A collection of Reinforcement Learning implementations with PyTorch
Phasic-Policy-Gradient
Recurrent Policies for Handling Partially Observable Environments
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
Example TRPO implementation with ReLAx
Example PPO implementation with ReLAx
Add a description, image, and links to the generalized-advantage-estimation topic page so that developers can more easily learn about it.
To associate your repository with the generalized-advantage-estimation topic, visit your repo's landing page and select "manage topics."