GitHub - chadwick-yao/DRL-for-Minecraft: Tree chopping agent in Minecraft

Overview

Minecraft, a globally popular sandbox video game, provides a dynamic, block-based universe where players build, explore, and interact with a variety of entities and landscapes. The project intends to train a tree chopping agent in the Minecraft environment using Deep Q-leanring from demonstration.

Requirements

The code depends on the following libraries:

Python 3.7
PyTorch 1.8.1
CUDA 11.2
MineRL
PFRL

The envionment of Minecraft is wrapped by MineRL, so please follow the document to install the MineRL library first. Be sure you could successfully run the following test code from the document :

import minerl
import gym
env = gym.make('MineRLNavigateDense-v0')


obs  = env.reset()
done = False
net_reward = 0

while not done:
    action = env.action_space.noop()

    action['camera'] = [0, 0.03*obs["compassAngle"]]
    action['back'] = 0
    action['forward'] = 1
    action['jump'] = 1
    action['attack'] = 1

    obs, reward, done, info = env.step(
        action)

    net_reward += reward
    print("Total reward: ", net_reward)

PFRL in a library implementing some state-of-art deep reinforcement learning algorithm. Our project use it for the prioritized buffer. You could find more details about their work here

Dataset

To download the demonstration data from human, you could follow the guidance in the document to the local path <YOUR LOCAL REPO PATH>/data/rawdata, or more specific:

sudo gedit ~/.bashrc

Add export MINERL_DATA_ROOT=<YOUR LOCAL REPO PATH>/data/rawdata at the end of the file and save it, then:

source ~./bashrc

Then download the specific dataset MineRLTreechopVectorObf-v0

python3 -m minerl.data.download "MineRLTreechopVectorObf-v0"

Then we should preprocess the dataset to extract the frames and calculate the actionspace:

python3 -u preprocess.py \
        --ROOT "./" \
        --DATASET_LOC "./data/rawdata/MineRLTreechopVectorObf-v0" \
        --actionNum 32 \
        --PREPARE_DATASET True \
        --n 25 \

It would generate the output frames in <YOUR LOCAL REPO PATH>/data/processdata and actionspace in <YOUR LOCAL REPO PATH>/actionspace

Train

If you like to train your own agent, be sure that your PC have at least 32 GB RAM:

python3 -u train.py \
        --ROOT "./" \
        --DATASET_LOC "./data/rawdata/MineRLTreechopVectorObf-v0" \
        --MODEL_SAVE "./saved_network"\
        --actionNum 32 \
        --n 25 \

Evaluate

We also provide our best train agent in <YOUR LOCAL REPO PATH>/saved_network/best_model.pt, you could run it by:

python3 -u evaluate.py \
        --ROOT "./" \
        --DATASET_LOC "./data/rawdata/MineRLTreechopVectorObf-v0" \
        --MODEL_SAVE "./saved_network"\
        --agentname best_model.pt
        --actionNum 32 \
        --n 25 \

The results of different architecture is shown in the table:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
actionspace		actionspace
data		data
dataloader		dataloader
image		image
network		network
saved_network		saved_network
.gitignore		.gitignore
README.md		README.md
Training process.png		Training process.png
evaluate.py		evaluate.py
prepocess.py		prepocess.py
preprocess.py		preprocess.py
train.py		train.py
viewdata.py		viewdata.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Requirements

Dataset

Train

Evaluate

About

Releases

Packages

Languages

chadwick-yao/DRL-for-Minecraft

Folders and files

Latest commit

History

Repository files navigation

Overview

Requirements

Dataset

Train

Evaluate

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages