Target-UCB

targetUCB.py is a simple class implementing of the Target-UCB bandit algorithm, as introduced in the AAAI-19 technical paper Leveraging Observations in Bandits: Between Risks and Benefits.

To import the Target-UCB class, simply use the command "from targetUCB import TUCB".

An example of how to construct and test a clique of 4 Target-UCB agents is also provided. Executing the targetUCB.py file as a script will run this clique for 100 episodes on a two-armed bandit problem and display the cumulative regret of all 4 agents.

Supplemental

The supplemental.pdf file is the supplemental material for the AAAI-19 technical paper. It provides proofs, additional experiments and methodological details about the human bandit experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Human bandit dataset		Human bandit dataset
README.md		README.md
Target_UCB_Slides.pdf		Target_UCB_Slides.pdf
supplemental.pdf		supplemental.pdf
targetUCB.py		targetUCB.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Target-UCB

Supplemental

About

Releases

Packages

Languages

lupuandr/Target-UCB

Folders and files

Latest commit

History

Repository files navigation

Target-UCB

Supplemental

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages