Release A C++based solver for MDP problems · areenberg/MDPSolver

Initial pre-release of our C++based solver for Markov Decision Process optimization problems. The solver is based on a Modified Policy Iteration (MPI) algorithm, which derives an epsilon-optimal policy that maximizes the expected total discounted reward, where epsilon is a tolerance parameter given to the algorithm. We further provide the user with the option to choose between three different value update methods as well as switching to an epsilon-optimal Value Iteration or Policy Iteration algorithm. See the Readme-file for further information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A C++based solver for MDP problems