Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
-
Updated
May 2, 2023 - Python
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
Papers about recommendation systems that I am interested in
A simple, extensible library for developing AutoML systems
Simple A/B testing library for Clojure
👤 Multi-Armed Bandit Algorithms Library (MAB) 👮
Demo project using multi-armed bandit algorithm
Python application to setup and run streaming (contextual) bandit experiments.
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
Simple implementation of the CGP-UCB algorithm.
More about the exploration-exploitation tradeoff with harder bandits
Offline evaluation of multi-armed bandit algorithms
Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions
COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context
A curated list on papers about combinatorial multi-armed bandit problems.
A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.
Software for the experiments reported in the RecSys 2019 paper "Multi-Armed Recommender System Bandit Ensembles"
Add a description, image, and links to the multi-armed-bandit topic page so that developers can more easily learn about it.
To associate your repository with the multi-armed-bandit topic, visit your repo's landing page and select "manage topics."