MDPSolver v0.9.7 Release Notes

Highlights

Users can now select the average reward optimality criteria.

Coming up

We are currently testing and developing the implementation of the built-in models.

Assets 3

17 May 15:21

areenberg

v0.9.6

2d5c397

Parallel computing for Markov Decision Processes

MDPSolver is a Python package for Markov Decision Processes (MDPs) with discounted rewards and infinite-horizon.

Features

Fast solver: Our C++-based solver is substantially faster than other MDP packages available for Python. See details in the documentation.
Three optimization algorithms: Value iteration, Policy iteration, and Modified policy iteration.
Three value-update methods: Standard, Gauss–Seidel, Successive over-relaxation.
Supports sparse matrices.
Employs parallel computing.

Assets 3

20 Oct 12:25

areenberg

v0.9-alpha

b35f7a2

A C++based solver for MDP problems Pre-release

Pre-release

Initial pre-release of our C++based solver for Markov Decision Process optimization problems. The solver is based on a Modified Policy Iteration (MPI) algorithm, which derives an epsilon-optimal policy that maximizes the expected total discounted reward, where epsilon is a tolerance parameter given to the algorithm. We further provide the user with the option to choose between three different value update methods as well as switching to an epsilon-optimal Value Iteration or Policy Iteration algorithm. See the Readme-file for further information.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MDPSolver v0.9.7 Release Notes

Highlights

Coming up

Features

Releases: areenberg/MDPSolver

MDPSolver v0.9.7

MDPSolver v0.9.7 Release Notes

Highlights

Coming up

Parallel computing for Markov Decision Processes

Features

A C++based solver for MDP problems