Skip to content

Simple implementation of the Target-UCB algorithm.

Notifications You must be signed in to change notification settings

lupuandr/Target-UCB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Target-UCB

targetUCB.py is a simple class implementing of the Target-UCB bandit algorithm, as introduced in the AAAI-19 technical paper Leveraging Observations in Bandits: Between Risks and Benefits.

To import the Target-UCB class, simply use the command "from targetUCB import TUCB".

An example of how to construct and test a clique of 4 Target-UCB agents is also provided. Executing the targetUCB.py file as a script will run this clique for 100 episodes on a two-armed bandit problem and display the cumulative regret of all 4 agents.

Supplemental

The supplemental.pdf file is the supplemental material for the AAAI-19 technical paper. It provides proofs, additional experiments and methodological details about the human bandit experiments.

About

Simple implementation of the Target-UCB algorithm.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages