trustful-bandits

We tackle here the problem of the two armed bandit from a stochastic algorithm standpoint. The goal is to optimize a repartition of capital between two trading agents online. The goal here is to maximize our profits, and thus, asymptotically allocating all our ressources to the best agent.
We also provide a comprehensive mathematical summary of the two main reference, proving the speed and the convergence of the used algorithm.

Here, we try to match the theoritical results achieved in the paper, apply them with real data and finally try to extend it to a multi-agent setup

References:

[1] Can you trust the bandit ? Damien Lamberton, Gilles Pagès and Pierre Tarrès
[2] How fast is the bandit ? Damien Lamberton, Gilles Pagès

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
.gitignore		.gitignore
Can you trust a bandit with your money.pdf		Can you trust a bandit with your money.pdf
README.md		README.md
agents.py		agents.py
bandits.py		bandits.py
main.ipynb		main.ipynb
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

trustful-bandits

References:

About

Releases

Packages

Languages

Nicolivain/trustful-bandits

Folders and files

Latest commit

History

Repository files navigation

trustful-bandits

References:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages