The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is fo…
actions
deep-reinforcement-learning
prediction
data-visualization
convergence
dqn
epsilon-greedy
states
rl
rewards
hyperparameter-tuning
model-evaluation
model-building
optimal-policy
markov-decision-process
epsilon-decay
mdp-framework
training-dqn-agent
q-values-tracking
minibatch-gradient-descent
-
Updated
Jul 9, 2021 - Jupyter Notebook