PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Contents of Repro

Data
Models
Src

Introduction

Here we provide the data including counterfactually augmented data for our methods and back_translation augmented data for comparison methods over sentiment analysis and natural language inference tasks, and codes including design of all models and combined function of contrastive and cross-entropy loss. More details can be seen in our paper.

Data

we use human-in-loop counterfactually augmented data provided by (Kaushik et al.,2019). counterfactually-augmented-data.

Task	domain	calss	original to counterfactual ratio
sentiment analysis	IMDb movie reviews	2	1:1
natural language inference	SNLI dataset	3	1:4

Other test data sources：

for sentiment analysis task
- IMDb download from: https://huggingface.co/datasets/imdb|
- Amazon download from: https://huggingface.co/datasets/Siki-77/amazon6_5core_polarity
- Yelp download from: https://huggingface.co/datasets/yelp_polarity
- Twitter download from: https://huggingface.co/datasets/carblacac/twitter-sentiment-analysis
- SST-2 download from: https://huggingface.co/datasets/gpt3mix/sst2/viewer/default/test
for natural language inference task:
- SNLI download from: https://huggingface.co/datasets/snli
- MNLI-m download from: https://huggingface.co/datasets/SetFit/mnli
- MNLI-mm download from: https://huggingface.co/datasets/SetFit/mnli
- Negation download from: https://huggingface.co/datasets/pietrolesci/stress_tests_nli
- Spelling error download from https://huggingface.co/datasets/pietrolesci/stress_tests_nli
- Word overlap download from: https://huggingface.co/datasets/pietrolesci/stress_tests_nli

[1] Kaushik D, Hovy E, Lipton Z. Learning The Difference That Makes A Difference With Counterfactually-Augmented Data[C]//International Conference on Learning Representations. 2019.

Models

pre-trained model + classification head list all pre-trained models used in our experiments, which can be indexed by following mode names through the HuggingFace tool：

Bert-base-uncased
Roberta-base
T5-base
Sentence-transformers/multi-qa-distilbert-cos-v1

Running

Environment

python3.8
PyTorch2.0.1

To run the code, you should install some packages and the appropriate torch version

pip install installpytorch
pip install requirement

Run the finetune code on IMDB CAD

cd runimdb\run

python run_bash.py

Run the finetune code on SNLI CAD

cd runsnli

python run_bash.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
runimdb		runimdb
runsnli		runsnli
README.md		README.md
installpytorch.txt		installpytorch.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Contents of Repro

Introduction

Data

Models

Running

About

Releases

Packages

Languages

Siki-cloud/PairCFR

Folders and files

Latest commit

History

Repository files navigation

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Contents of Repro

Introduction

Data

Models

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages