#

multi-armed-bandit

Here are 120 public repositories matching this topic...

mpatacchiola / dissecting-reinforcement-learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

reinforcement-learning genetic-algorithm markov-chain deep-reinforcement-learning q-learning neural-networks mountain-car sarsa multi-armed-bandit inverted-pendulum actor-critic temporal-differencing-learning drone-landing dissecting-reinforcement-learning

Updated May 2, 2023
Python

SMPyBandits

SMPyBandits / SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

python open-source research internet-of-things simulations multi-arm-bandits multi-armed-bandit learning-theory bandit-algorithms cognitive-radio

Updated Apr 30, 2024
Jupyter Notebook

OnYuKang / Recommendation-systems-paperlist

Papers about recommendation systems that I am interested in

deep-learning social-network survey collaborative-filtering recommender-system recommendation multi-armed-bandit explainable-recommendations session-based-recommendation-system

Updated Mar 17, 2020

MLBazaar / BTB

A simple, extensible library for developing AutoML systems

hyperparameter-optimization gaussian-processes multi-armed-bandit automl

Updated Jul 28, 2023
Python

taoensso / touchstone

Simple A/B testing library for Clojure

clojure epl multi-armed-bandit split-testing taoensso engagement-testing

Updated Mar 19, 2024
Clojure

alison-carrera / mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

Unity-Technologies / BanditDungeon

Demo project using multi-armed bandit algorithm

unity unity3d multi-armed-bandit

Updated Feb 10, 2020
C#

roycoding / slots

A multi-armed bandit library for Python

python multi-armed-bandit

Updated Jan 13, 2020
Python

Nth-iteration-labs / streamingbandit

Python application to setup and run streaming (contextual) bandit experiments.

streaming online sequential multi-armed-bandit bandit mab contextual cmab multi-armed

Updated Mar 31, 2023
Python

Nth-iteration-labs / contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Updated Jul 25, 2020
R

mab

stitchfix / mab

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

go golang data-science reinforcement-learning thompson-sampling experimentation multi-armed-bandits multi-armed-bandit thompson multiarmed-bandits

Updated Nov 12, 2024
Go

google / MAB

R package for Multi-Armed Bandit Simulation Study

multi-armed-bandit

Updated Aug 18, 2017
R

ardaegeunlu / Contextual-Gaussian-Process-Bandit-Optimization

Simple implementation of the CGP-UCB algorithm.

machine-learning reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms gaussian-processes multi-armed-bandit

Updated Nov 30, 2019
Python

gdmarmerola / advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

machine-learning multi-armed-bandit bandit-algorithms

Updated May 12, 2019
Jupyter Notebook

antoine-hochart / bandit_algo_evaluation

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

improve-ai / python-ranker

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

python machine-learning reinforcement-learning ai personalization xgboost ab-testing recommender-system multi-armed-bandit multivariate-testing contextual-bandits improve-ai

Updated Jun 9, 2023
Python

jacksonpradolima / coleman4hcs

COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context

tcp continuous-integration ci multi-armed-bandit hcs coleman mab test-case-prioritization tcpci highly-configurable-system

Updated Nov 12, 2024
Jupyter Notebook

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

Updated May 10, 2021

nathanwispinski / meta-rl

A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.

python reinforcement-learning deep-learning neural-network rnn a3c multi-armed-bandit haiku a2c jax

Updated Feb 27, 2023
Jupyter Notebook

ir-uam / EnsembleBandits

Software for the experiments reported in the RecSys 2019 paper "Multi-Armed Recommender System Bandit Ensembles"

ensemble recommender-system multi-armed-bandit

Updated Aug 16, 2019
Java

Improve this page

Add a description, image, and links to the multi-armed-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-armed-bandit topic, visit your repo's landing page and select "manage topics."