GitHub - preneond/Markov-Decision-Process-Value-Iteration-Wumpus: MDP Value Iteration algorithm applied in Wumpus world with uncertainty

MDP-VI-Wumpus-World

MDP Value Iteration applied in Wumpus World to help the robot find its way to the pot(s) of gold in a world filled with deadly pits.

Environment

Robot moves in grid world with 4 possible actions that have stochastic effects. Gold, pits and wumpus are terminal states. Reaching gold gives agent large positive utility while falling into a pit or encountering Wumpus gives agent large negative utility. Agent also gets small penalty for each move.

Robot can execute following actions with stochastic effects (class Action):

NORTH – Actual effect: 80% NORTH, 10% EAST, 10% WEST
SOUTH – Actual effect: 80% SOUTH, 10% EAST, 10% WEST
EAST – Actual effect: 80% EAST, 10% NORTH, 10% SOUTH
WEST – Actual effect: 80% WEST, 10% NORTH, 10% SOUTH

If the robot executes a move action that would end up in an obstacle, it bounces back to its current-position. The environment is a matrix MxN, where the first index represents columns (x-coordinate) and the second index represents rows (y-coordinate). The columns (rows) are indexed starting from 0, i.e. we have columns (rows) 0,1,…,M-1 (N-1). Each cell can contain (class CellContent):

EMPTY
OBSTACLE
GOLD
PIT

The simulation finishes if the agent reaches the gold, falls into a pit, or after h steps.

Rewards

-1 at each move action
-100 + (-1) for action that results in a pit
-100 + (-1) when a Wumpus moves to the same cell as the agent
100 + (-1) for action that reaches the gold

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
lib		lib
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDP-VI-Wumpus-World

Environment

Rewards

About

Releases

Packages

Languages

preneond/Markov-Decision-Process-Value-Iteration-Wumpus

Folders and files

Latest commit

History

Repository files navigation

MDP-VI-Wumpus-World

Environment

Rewards

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages