RecurrentMaskablePPO

RecurrentMaskablePPO is a custom implementation of the Proximal Policy Optimization (PPO) algorithm, designed specifically for environments with recurrent states and maskable actions. This implementation is based on the stable-baselines3-contrib repository, which extends the popular reinforcement learning library, stable-baselines3.

Features

Compatible with environments that have recurrent states and require masking of certain actions.
Built on top of the stable-baselines3 library, inheriting its modularity and ease of use.
Efficient and scalable implementation for complex tasks.

Installation

To install RecurrentMaskablePPO, follow the steps below:

Make sure you have Python 3.7 or later installed on your system. You can download the latest version from the official Python website.
Install stable-baselines3-contrib using requirements.txt:

pip install -r requirements.txt

Clone this repository:

git clone https://github.com/yourusername/RecurrentMaskablePPO.git

Navigate to the cloned repository and install the package:

cd recurrent_msakable
pip install -e .

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
common		common
test		test
README.md		README.md
__init__.py		__init__.py
policies.py		policies.py
ppo_mask_recurrent.py		ppo_mask_recurrent.py
requirement.txt		requirement.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RecurrentMaskablePPO

Features

Installation

About

Releases

Packages

Languages

wdlctc/recurrent_maskable

Folders and files

Latest commit

History

Repository files navigation

RecurrentMaskablePPO

Features

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages