Retrograph

(aka Common Sense of World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers)

accepted at EMNLP/DeeLIO 2020.

Abstract

Following the major success of neural language models (LMs) such as BERT or GPT-2 on a variety of language understanding tasks, recent work focused on injecting (structured) knowledge from external resources into these models. While on the one hand, joint pretraining (i.e., training from scratch, adding objectives based on external knowledge to the primary LM objective) may be prohibitively computationally expensive, post-hoc fine-tuning on external knowledge, on the other hand, may lead to the catastrophic forgetting of distributional knowledge. In this work, we investigate models for complementing the distributional knowledge of BERT with conceptual knowledge from ConceptNet and its corresponding Open Mind Common Sense (OMCS) corpus, respectively, using adapter training. While overall results on the GLUE benchmark paint an inconclusive picture, a deeper analysis reveals that our adapter-based models substantially outperform BERT (up to 15-20 performance points) on inference tasks that require the type of conceptual knowledge explicitly present in ConceptNet and OMCS.

Paper (EMNLP/DeeLIO 2020 Proceedings to follow)

Link To Paper

Key people

Description

Retrograph is the official repo behind University of Mannheim's, TU Darmstadt's and Wluper's Commonsense Adapter Paper.

The key idea is that one can inject knowledge into pretrained language models using Adapters.

We try two methods to generate training data for the adapters:

OMCS
Random walk from ConceptNet

We evaluate on:

glue
csqa
copa
siqa

Key results, you can find in the paper: Link To Paper

A - Getting it running:

Environment: python 3.6

Please, follow these instructions to execute the experiments.

0 - Download BERT (This needs to be done for all experiments)

Step 0: Download BERT

bash ./0_download_bert.sh

It creates:

models/BERT_BASE_UNCASED

Next Steps:

Generate Random Walks and Pretrain Adapter -> Go to B - Random Walks and Pretraining
Finetune on existing Adapters -> Go to C - Finetuning on Pretrained Adapters:

GLUE
CSQA
COPA
SIQA

B - Random Walks and Pretraining

Follow these steps for pretraining adapter.

1 - Download Relations

Step 1: Download Relations

bash ./1_download_relations.sh

It creates:

relations/cn_relationType*.txt

2 - Creating Random Walks

Step 2: Create the sequences of tokens using random walks generated by node2vec:

bash ./2_create_random_walks.sh

It creates the main file randomwalks/random_walk_1.0_1.0_2_15.p and others also (randomwalks/cn_assertions_filtered.tsv)

3 - Generating the Corpus (This takes a serious while)

Step 3: Create natural language text from the random walks:

bash ./3_generate_corpus.sh

The generated corpus will be used as input for BERT + Adapters. It creates a file in TF format: randomwalks/rw_corpus_1.0_1.0_2_15_nl.tf (and also generates: randomwalks/rw_corpus_1.0_1.0_2_15_nl.tf)

4 - Pretraining Adapter

Step 4: Pretrain the adapter using the RW corpus:

bash ./4_pretrain_adapter.sh

Creates a model in: models/output_pretrain_adapter

C - Finetuning on Pretrained Adapters

9 - Download Pretrained Adapters (needs to be done if you don't have already pretrained adapters)

9_download_pretrained_adapters_rw30.sh
9_download_pretrained_adapters_omcs.sh

ALL models will be saved in Creates a model in: models/output_model_finetunning Modify the task_2_....sh files if you want to change hyper parameters

GLUE

Run all glue_1,2_.sh files in that order

CommonsenseQA

Run all csqa_1,2_.sh files in that order

COPA

Run all copa_1,2_.sh files in that order

SIQA

Run all siqa_1,2_.sh files in that order

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
archive		archive
data_utility		data_utility
download_utility		download_utility
images		images
randomwalks_utility		randomwalks_utility
results_utility		results_utility
retrograph		retrograph
training_utility		training_utility
utility		utility
.editor-settings		.editor-settings
.gitignore		.gitignore
0_download_bert.sh		0_download_bert.sh
1_download_relations.sh		1_download_relations.sh
2_create_random_walks.sh		2_create_random_walks.sh
3_generate_corpus.sh		3_generate_corpus.sh
4_pretrain_adapter.sh		4_pretrain_adapter.sh
9_download_pretrained_adapters_omcs.sh		9_download_pretrained_adapters_omcs.sh
9_download_pretrained_adapters_rw30.sh		9_download_pretrained_adapters_rw30.sh
LICENSE		LICENSE
README.md		README.md
copa_1_download_copa.sh		copa_1_download_copa.sh
copa_2_finetune_adapter.sh		copa_2_finetune_adapter.sh
copa_2_finetune_bert.sh		copa_2_finetune_bert.sh
csqa_1_download_commonsenseqa.sh		csqa_1_download_commonsenseqa.sh
csqa_2_finetune_adapter.sh		csqa_2_finetune_adapter.sh
csqa_3_eval_adapter.sh		csqa_3_eval_adapter.sh
glue_1_download_glue.sh		glue_1_download_glue.sh
glue_2_finetune_adapter.sh		glue_2_finetune_adapter.sh
setup.py		setup.py
siqa_1_download_siqa.sh		siqa_1_download_siqa.sh
siqa_2_finetune_adapters.sh		siqa_2_finetune_adapters.sh
siqa_2_finetune_bert.sh		siqa_2_finetune_bert.sh
siqa_calc_acc_testset.py		siqa_calc_acc_testset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrograph

(aka Common Sense of World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers)

accepted at EMNLP/DeeLIO 2020.

Abstract

Paper (EMNLP/DeeLIO 2020 Proceedings to follow)

Key people

Description

A - Getting it running:

0 - Download BERT (This needs to be done for all experiments)

Next Steps:

B - Random Walks and Pretraining

1 - Download Relations

2 - Creating Random Walks

3 - Generating the Corpus (This takes a serious while)

4 - Pretraining Adapter

C - Finetuning on Pretrained Adapters

9 - Download Pretrained Adapters (needs to be done if you don't have already pretrained adapters)

GLUE

CommonsenseQA

COPA

SIQA

About

Releases

Packages

Contributors 4

Languages

License

Wluper/Retrograph

Folders and files

Latest commit

History

Repository files navigation

Retrograph

(aka Common Sense of World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers)

accepted at EMNLP/DeeLIO 2020.

Abstract

Paper (EMNLP/DeeLIO 2020 Proceedings to follow)

Key people

Description

A - Getting it running:

0 - Download BERT (This needs to be done for all experiments)

Next Steps:

B - Random Walks and Pretraining

1 - Download Relations

2 - Creating Random Walks

3 - Generating the Corpus (This takes a serious while)

4 - Pretraining Adapter

C - Finetuning on Pretrained Adapters

9 - Download Pretrained Adapters (needs to be done if you don't have already pretrained adapters)

GLUE

CommonsenseQA

COPA

SIQA

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages