SimpleTOD: A Simple Language Model for Task-Oriented Dialogue

Authors: Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, and Richard Socher

Introduction

Task-oriented dialogue (TOD) systems accomplish a goal described by a user in natural language. They often use a pipeline approach. Such approach requires natural language understanding (NLU) for belief state tracking, dialogue management (DM) for deciding which actions to take based on those beliefs, and natural language generation (NLG) for generating responses.

We propose recasting task-oriented dialogue as a simple, causal (unidirectional) language modeling task. We show that such an approach can solve all the sub-tasks in a unified way using multi-task maximum likelihood training. The proposed Simple Task-Oriented Dialogue (SimpleTOD) approach enables modeling of the inherent dependencies between the sub-tasks of task-oriented dialogue, by optimizing for all tasks in an end-to-end manner.

Paper link: https://arxiv.org/abs/2005.00796

Blog link: https://blog.einstein.ai/simpletod

Installation (Modified)

The package general requirements are

Python >= 3.6
Pytorch >= 1.2 (installation instructions here)
Transformers >= 2.5.1 (installation instructions here)

1- The package can be installed by running the following command.

pip install -r requirements.txt

And install required Spacy en_core_web_sm

python -m spacy download en_core_web_sm

2- Running inside docker container

docker build -t <image_name>:<tag> -f Dockerfile

3- Conda Environment The environment.yml is available if you are using Anaconda

Usage

This section explains steps to preprocess MultiWOZ dataset and training the model.

Preprocessing:

It includes downloading MultiWOZ dataset, performing delexicaliztion, and creating dataset for language model

bash create_dataset.sh

Each dialogue turn will be represented as a sequence, which contains previous user/system turns, belief, action, and delexicalized response

<|endoftext|> <|context|> <|user|> i am looking for a college type attraction . <|system|> there are 18 colleges i have found , would you prefer 1 in town centre or in the west ? <|user|> i would like to visit on in town centre please . <|system|> sure , we have thirteen options , 10 of which are free . may i suggest king s college , or hughes hall ? <|user|> okay , may i have their postcode , entrance fee , and phone number ?<|endofcontext|> 
<|belief|> attraction type college , attraction name kings college|hughes hall , attraction area centre <|endofbelief|> 
<|action|> attraction inform name , attraction inform fee , attraction inform post , attraction inform phone <|endofaction|> 
<|response|> sure , the post code to [attraction_name] is [attraction_postcode] , the entrance fee is free , and phone number [attraction_phone] <|endofresponse|> <|endoftext|>

DST training:

training the model for predicting belief states.

train_dst.sh $GPU gpt2 $GPT2_TYPE $BATCH

If you want to enable FP16 Training, please install NVIDIA apex

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" --global-option="--fast_multihead_attn" --no-build-isolation .

In my case, I would like to leverage RTX 4090, so my training will be For instance

train_dst.sh 0 gpt2 gpt2 6

For this task, we include none slot values in the sequence. We observed that this will improve SimpleTOD performance on DST by reducing false positive rates.

<|endoftext|> <|context|> <|user|> am looking for a place to to stay that has cheap price range it should be in a type of hotel <|endofcontext|> 
<|belief|> hotel name not mentioned , hotel area not mentioned , hotel parking not mentioned , hotel pricerange cheap , hotel stars not mentioned , hotel internet not mentioned , hotel type hotel <|endofbelief|> <|endoftext|>

End-to-End training:

In this step, we train SimpleTOD on the sequence of context+belief+action+delex response. Compared to DST task, we do not include none slot values, because of the sequence length limitaiton od GPT2.

train_end2end.sh $GPU gpt2 $GPT2_TYPE $BATCH

In my case, I would like to leverage RTX 4090 with FP16, so my training will be For instance

bash train_end2end.sh 1 gpt2 gpt2 6

Generation:

This script will generate SimpeTOD belief/action/responses. Generation is based on each dialogue, where it create context for each turn and save the generated belief, action, and responses for the dialogue.

CUDA_VISIBLE_DEVICES=$GPU python generate_dialogue.py $CHECKPOINT $DECODING

Example

CUDA_VISIBLE_DEVICES=1 python generate_dialogue.py --checkpoint output/gpt2/checkpoint-470000 --decoding greedy

It will save the model output in a json file MODEL_OUTPUT which contains all dialogues with groundtruth user and system responses as well.

In order to use DB search during generation, set --use_db_search (this will use oracle DB search results)
In order to use DB search dynamically, set --use_db_search and --use_dynamic_db
To use oracle belief and actions, simple set --use_oracle_belief and --use_oracle_action

Evaluation

MultiWOZ evaluation contains two part, Dialogue State Tracking (DST) and End-to-End.

DST evaluation

In order to compute joint accuracy, simply run the following script using the generated MODEL_OUTPUT file. it will use the generated belief states to compute the metric. It will compute joint accuracy without any label cleaning.

python compute_joint_acc.py $MODEL_OUTPUT

There are two types of label cleaning that can be used to compute joint accuracy.

To use default lable cleaning suggested by MultiWOZ author, please set --default_cleaning (for more details, please refer to MultiWOZ FAQ.5)
We found other type of noisy annotation. Please refer to the paper for more details different types of noisy annotations. Here, we provide an option to compute joint accuracy by fixing Type 2 noisy annotation (where one or more slots are not labeled in some turns.) by setting --type2_cleaning
The complete list of Type 2 noisy annotations is here. For more details on noisy annotation on MultiWOZ dataset, please refer to the paper

End-to-End evaluation

In order to compute inform/success/BLEU, simply run the following script. It will load generated belief states and responses, and computes the metrics.

python evaluate_multiwoz.py $MODEL_OUTPUT

Demo

In order to test the model in real conversation with human, we have provided a simple script where user can input text in a multi turn setting, and see the responses from SimpleTOD. It will generate lexicalized responses and belief states at each turn. For more information, please read the blog.

python demo.py $CHECKPOINT $DECODING

eg:

python demo.py output/gpt2/checkpoint-470000 nucleus 0.5

Citation

@article{hosseini2020simple,
  title={A simple language model for task-oriented dialogue},
  author={Hosseini-Asl, Ehsan and McCann, Bryan and Wu, Chien-Sheng and Yavuz, Semih and Socher, Richard},
  journal={arXiv preprint arXiv:2005.00796},
  year={2020}
}

License

The code is released under the BSD-3 License - see LICENSE for details

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
images		images
models		models
noisy_annotations		noisy_annotations
utils		utils
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
compute_joint_acc.py		compute_joint_acc.py
create_dataset.sh		create_dataset.sh
demo.py		demo.py
environment.yml		environment.yml
evaluate_multiwoz.py		evaluate_multiwoz.py
generate_dialogue.py		generate_dialogue.py
main.py		main.py
prepare_simpletod_data.py		prepare_simpletod_data.py
preprocess_multiwoz.py		preprocess_multiwoz.py
requirements.txt		requirements.txt
train_dst.sh		train_dst.sh
train_end2end.sh		train_end2end.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimpleTOD: A Simple Language Model for Task-Oriented Dialogue

Introduction

Table of Contents

Installation (Modified)

Usage

Preprocessing:

DST training:

End-to-End training:

Generation:

Evaluation

DST evaluation

End-to-End evaluation

Demo

Citation

License

About

Releases

Packages

Languages

License

yodiaditya/simpletod

Folders and files

Latest commit

History

Repository files navigation

SimpleTOD: A Simple Language Model for Task-Oriented Dialogue

Introduction

Table of Contents

Installation (Modified)

Usage

Preprocessing:

DST training:

End-to-End training:

Generation:

Evaluation

DST evaluation

End-to-End evaluation

Demo

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages