Empirical Analysis with POMDPs.jl Library: A Comparative Study

In this study, we aim to perform an empirical analysis of multiple algorithms using the package **POMDPs.jl **and examine the performance of each algorithm.

This study is part of DS8010 -Interactive Learning in Decision Process course project, Winter 2023 term. Details about the course can be found here.

POMDP Problems

Crying baby
Tiger problem
Paint problem
Query problem
Mini hallway problem
Rock sample problem
T-Maze problem

Algorithm

We solve the above POMDP problem using the algorithms listed below:

QMDP
Successive Approx-imations of the Reachable Space under Optimal Policies(SARSOP)
Partially Observable Monte Carlo Planning (POMCP)
POMCP with observation widening (POMCPOW)
Fast Informed Bound (FIB)
Point-Based Value Iteration (PBVI)

Installation

Requires Julia 1.8.1 or newer. To download Julia, go to julialang.org/downloads

When Julia is installed, open Julia terminal and install the POMDPs.jl package version 0.9.5 or newer using below:

using Pkg
Pkg.add("POMDPs")

And install other required packages

Pkg.add("DataFrames")
Pkg.add("CSV")
Pkg.add("ElectronDisplay")
Pkg.add("POMDPModels")
Pkg.add("ParticleFilters")
Pkg.add("POMDPLinter")
Pkg.add("Random")

Pkg.add("QMDP")
Pkg.add("FIB")
Pkg.add("PointBasedValueIteration")
Pkg.add("SARSOP")
Pkg.add(BasicPOMCP")
Pkg.add("POMCPOW")

Quick Start

To run this experiment, open the **main.jl **and run the entire file in Julia.

An simple example of how to implement a tiger problem individually is open the tigerProblem_solvers.jl in problems_simulation folder and call the solver function below

include("problems_simulation/tigerProblem_solvers.jl")

# do not print belief, each round contain 1000 trials, 10 rounds
run_tiger_solvers(false, 1000,10)

The code above will then record the experiment result, such as runtime, reward, discounted rewar, etc., for each problem in the result folder.

After complete the experiment for all problems. Open jupyter file **results/summary_results.ipynb **and organize the result into performance matrix and generate types of plots.

Outcomes

Results of the experiments can be found in jupyter file: results/summary_results.ipynb

An organize performance matrix csv file can be found in: results/summary.csv

Grouped bar plot for discounted reward

Grouped Bar plot for runtime

Report

Project Proposal can be found here: DS8010_Yang_Research_Proposal.pdf
Intermediate report can be found here: DS8010_Project_Report_Yang_Part1.pdf
Final complete report can be found here: DS8010_Project_Report_Yang.pdf

Timeline

Reference

Edward Balaban Tim A. Wheeler Jayesh K. Gupta Maxim Egorov, Zachary N. Sunberg andMykel J. Kochenderfer. POMDPs.jl: A Framework for Sequential Decision Making under Un-certainty.Journal of Machine Learning Research, 18, April 2017

License

LICENSE.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Empirical Analysis with POMDPs.jl Library: A Comparative Study

POMDP Problems

Algorithm

Installation

Quick Start

Outcomes

Grouped bar plot for discounted reward

Grouped Bar plot for runtime

Report

Timeline

Reference

License

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
image/README		image/README
problems		problems
problems_simulation		problems_simulation
results		results
DS8010_Project_Report_Yang.pdf		DS8010_Project_Report_Yang.pdf
DS8010_Project_Report_Yang_Part1.pdf		DS8010_Project_Report_Yang_Part1.pdf
DS8010_Yang_Research_Proposal.pdf		DS8010_Yang_Research_Proposal.pdf
LICENSE.md		LICENSE.md
README.md		README.md
main.jl		main.jl

License

aliceyshu/8010_project

Folders and files

Latest commit

History

Repository files navigation

Empirical Analysis with POMDPs.jl Library: A Comparative Study

POMDP Problems

Algorithm

Installation

Quick Start

Outcomes

Grouped bar plot for discounted reward

Grouped Bar plot for runtime

Report

Timeline

Reference

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages