e-greedy

Here are 7 public repositories matching this topic...

tatiana-boura / MSc-in-AI-Demokritos-Reinforcement-Learning-Course

Implementation of an Q-learning, ϵ-greedy agent that learns how to play the game with the other agents he is connected to.

reinforcement-learning q-learning multiagent-systems e-greedy

Updated Sep 11, 2023
Python

JoelJa835 / MAB_Algorithms

Star

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

Anca-Mt / CartPole-DeepQLearning

Star

DQN agent with e-greedy / softmax policy, experience replay and target network.

policy dqn softmax open-ai-gym cartpole-environment reinfrocement-learning e-greedy

Updated Aug 20, 2024
Python

Murtazali05 / Multi-armed-bandit

Star

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

n4i9kita / ExploratoryProject

Star

Analysis of various multi armed bandit algorithms over normal and heavy-tailed distributions.

reinforcement-learning multi-armed-bandits multiarmed-bandits e-greedy normaldistr

Updated May 12, 2021
Jupyter Notebook

OrestisMk / RF-Q_learning-taxi_driver--Lunanlander-Policy-gradient-

Star

This is a project of reinforcement learning which contains two different environments. The first environment is the taxi driver problem in 4x4 space with the simple Q-learning update rule. In this task, we compared the performance of the e-greedy policy and Boltzmann policy. As a second environment, we chose the LunarLander from the open gym. Fo…

reinforcement-learning q-learning boltzmann-exploration policy-gradient lunarlander-v2 taxi-driver e-greedy

Updated Jan 15, 2021

Stepan-Makarenko / Multi-armed-bandit-research

Star

multi-armed-bandits ucb1 e-greedy

Updated Dec 17, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the e-greedy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the e-greedy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

e-greedy

Here are 7 public repositories matching this topic...

tatiana-boura / MSc-in-AI-Demokritos-Reinforcement-Learning-Course

JoelJa835 / MAB_Algorithms

Anca-Mt / CartPole-DeepQLearning

Murtazali05 / Multi-armed-bandit

n4i9kita / ExploratoryProject

OrestisMk / RF-Q_learning-taxi_driver--Lunanlander-Policy-gradient-

Stepan-Makarenko / Multi-armed-bandit-research

Improve this page

Add this topic to your repo