GitHub - IgorKolodziej/DoomRL: Training a mighty reinforcement learning agent to play Doom.

Doom - Reinforcement Learning

This Project is created as part of research workshops course included in Data Science Studies. This Project is going to be done by a team of 5 people.

Project Team:

Objectives

This project aims to implement and evaluate reinforcement learning models to complete scenarios in the VizDoom environment. The training process utilized two machine learning methods: Proximal Policy Optimization (PPO) and Advantage Actor-Critic (A2C). The models were trained to maximize performance by interacting with the environment and receiving rewards for actions taken.

Repository Structure

Branches

master-cnn-defend: Contains models and code for the Basic and Defend Center scenarios.
master: Contains models and code for the Death Corridor scenario.

File Structure

src/game: VizDoom and Gymnasium integrations
src/metrics/: Metrics implementations for TensorBoard
src/models/: Custom models implementations
src/rewards/: Custom environment rewards
src/training/: Training scripts
src/utils: Utilities such as image preprocessing
reports/: Full report from the project
run.py: Model testing script

Description

Reinforcement Learning Concept

Reinforcement learning (RL) is a type of machine learning where an agent learns by interacting with its environment and receiving rewards for actions taken. Key elements include:

Agent: The program or algorithm making decisions.
Environment: Everything the agent interacts with.
Actions: Possible decisions or movements by the agent.
State: Current situation or configuration of the environment.
Rewards: Feedback from the environment, indicating the quality of actions.
Policy: Strategy defining actions based on states.

VizDoom

VizDoom is a platform for training and testing AI algorithms, particularly in reinforcement learning, within the Doom game environment. It provides a 3D environment and a Python API for integration with machine learning tools.

Scenarios

Basic: The agent aims to shoot a target directly in front of it.
Defend Center: The agent must defend itself by shooting approaching enemies.
Death Corridor: The agent navigates a narrow corridor filled with enemies, aiming to reach the end.

Training Process

The training involved:

Implementing and testing models using PPO and A2C algorithms.
Preprocessing game state data, including image processing with CNNs.
Evaluating the performance using various metrics such as ammo usage, episode length, kill count, and reward.

Results and Conclusions

PPO outperformed A2C, showing better stability and efficiency in training.
Training solely on images using CNNs was more effective than including scalar game state data.
Metrics indicated significant improvement in agent performance over time.

Future Development

Future improvements could focus on exploring more scenarios, optimizing reward function parameters, and potentially automating the tuning process using methods like grid search. A comparison of agent performance against human players could also provide valuable insights.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.idea		.idea
reports		reports
src		src
.gitignore		.gitignore
README.md		README.md
_vizdoom.ini		_vizdoom.ini
environment.yml		environment.yml
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Doom - Reinforcement Learning

Project Team:

Objectives

Repository Structure

Branches

File Structure

Description

Reinforcement Learning Concept

VizDoom

Scenarios

Training Process

Results and Conclusions

Future Development

About

Releases

Packages

Languages

IgorKolodziej/DoomRL

Folders and files

Latest commit

History

Repository files navigation

Doom - Reinforcement Learning

Project Team:

Objectives

Repository Structure

Branches

File Structure

Description

Reinforcement Learning Concept

VizDoom

Scenarios

Training Process

Results and Conclusions

Future Development

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages