RL-car

Training a reinforcement learning agent for OpenAI's Car Racing environment.

Algorithms used

Deep Q-Learning[1]

We implement a Deep Q-Network and its forward pass in the DQN class in model.py. Our network takes a single frame as input.

The training loop for the DeepQ network is defined in deepq.py file. The target network updations and the deepQ step are defined in the learning.py file.

The action space is defined in the action.py file. We experimented with various action sets and eventually decided to stick with the 7 actions as defined in the file.

schedule.py is the script that defines the exploration-exploitation tradeoff. We begin with a p_initial value of 1 which means we would like to focus on exploration early on during the training.

Double Deep Q-Learning[2]

We implement a Double Deep Q-Network and its forward pass in the DQN class in model.py. Our network takes a single frame as input similar to the Deep Q learning experiment.

The traning loop for the Double Deep Q network is defined in the file deepq_double.py. The target network updation and double deepQ step is defined in the learning_double.py file.

For this experiment, we use the same action spaces as the DeepQ experiment.

We use the same scheme for exploration-exploitation tradeoff as in the Deep Q leanring experiment.

Noticable techniques

Replay buffer for storing agent's memories
Target q-network to make q-learning stable

To install the gym environment

extract sdc_gym.zip
cd sdc_gym
pip install -e .["box2d"]

To run the evaluation code

python evaluate_racing.py score

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
other-models		other-models
plots-1		plots-1
plots-local/plots-1		plots-local/plots-1
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
action.py		action.py
agent-baseline-1.pt		agent-baseline-1.pt
agent.pt		agent.pt
deepq.py		deepq.py
deepq_double.py		deepq_double.py
evaluate_racing.py		evaluate_racing.py
evaluate_racing_cluster.py		evaluate_racing_cluster.py
learning.py		learning.py
learning_double.py		learning_double.py
model.py		model.py
replay_buffer.py		replay_buffer.py
report.md		report.md
schedule.py		schedule.py
sdc_gym.zip		sdc_gym.zip
train_racing.py		train_racing.py
train_racing_cluster.py		train_racing_cluster.py
train_racing_cluster_double.py		train_racing_cluster_double.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-car

Algorithms used

Noticable techniques

To install the gym environment

To run the evaluation code

References

About

Releases

Packages

Contributors 2

Languages

arorashu/rl-car

Folders and files

Latest commit

History

Repository files navigation

RL-car

Algorithms used

Noticable techniques

To install the gym environment

To run the evaluation code

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages