Machine_ToM

The Implementation of "Machine Theory of Mind", ICML 2018, You can read the paper http://proceedings.mlr.press/v80/rabinowitz18a/rabinowitz18a.pdf

Install

will be update soon

Structure

└─Machine_ToM
    ├─agent : Agent directory used in Experiments
    ├─environment : Environment
    ├─experiment :  Experiment files
    ├─model : Machine ToM models
    └─utils : dataloader, storage and visualization

Run the code

python main.py --num_exp 2 --sub_exp 1 --num_epoch 1000

Check the environment, agent, etc

Environment

python environment/env.py

Agent

python agent/reward_seeking_agent.py

Experiment Description

Experiment 1: In this experiment, we predict the future action of current state with random agents whose policies are depending on Dirichlet dist. You can adjust the number of past trajectory by --sub_exp.
Experiment 2: In this experiment, we predict the future action, consumption, successor representation of value iteration agents. The number of walls is sampled between 0 and 4.

There are three sub experiments.

first sub experiment : MToM with the full trajectory of an agent on single past MDP. Agent gets a panalty(-0.01) for every move.
second sub experiment : MToM with partial trajectory(one step) of an agent on single past MDP. Agent gets panalty(-0.01) for every move.
third sub experiment : same as first sub experiment. But agent gets high panalty(-0.05) for every move.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
agent		agent
environment		environment
experiment1		experiment1
experiment2		experiment2
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_collection.py		data_collection.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine_ToM

Contents

Install

Structure

Run the code

Check the environment, agent, etc

Experiment Description

About

Releases

Packages

Contributors 3

Languages

License

CILAB-MA/Machine_ToM

Folders and files

Latest commit

History

Repository files navigation

Machine_ToM

Contents

Install

Structure

Run the code

Check the environment, agent, etc

Experiment Description

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages