NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
-
Updated
Mar 20, 2024 - Python
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
Safe Pontryagin Differentiable Programming (Safe PDP) is a new theoretical and algorithmic safe differentiable framework to solve a broad class of safety-critical learning and control tasks.
Implementations of SAILR, PDO, and CSC
[AAAI 2022] The official implementation of CPQ in "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning"
Constrained Hierarchical Deep Reinforcement Learning with Differentiable Formal Specifications
Add a description, image, and links to the constrained-reinforcement-learning topic page so that developers can more easily learn about it.
To associate your repository with the constrained-reinforcement-learning topic, visit your repo's landing page and select "manage topics."