GitHub - tw-ilson/tw-policies: policy gradient optimization library for reinforcement learning.

Introduction

tw-policies is a simple library/tool for Reinforcement Learning, with a focus in the policy gradient class of algorithms. Notably, it contains numpy-only implementations of the policy gradient for discrete and guassian linear function approximations. It contains several premade algorithm implementations using this framework, such as the REINFORCE algorithm and several Actor-Critic algorithms like A2C and PPO.

Usage

You can find the networks and inference code for the policies and state-value evaluation code in the two main modules models.policy which contains the most essential component: the function approximations for different kinds of policies. and models.value. You can find premade algorithm implementations in the agents. Some feed-forward neural networks are available in networks.

Todo:

test PPO implementation
Support offline training
Multi-Headed network outputs
Goal-conditioned training
Shared weights feature extraction for Actor-Critic
automated tuning?

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
example		example
plot		plot
policies		policies
weights		weights
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
cs4100_report.zip		cs4100_report.zip
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Usage

Todo:

About

Releases

Packages

Languages

tw-ilson/tw-policies

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

Todo:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages