Cheap Learning

This repository follows the research presented in the article "Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data".

We test different NLP techniques and models to classify text in a context of scarce data. We train and test a Naive Bayes model, a Weak Supervision technique, a Prompt Engineering and a Transfer Learning techniques.

For each one of these techniques, we train and test on two different datasets: the IMDb movie reviews and the Wikipedia Detox.

For more details, please refer to the paper.

1. Installation

The package is written in Python (version: 3.8). We recommend that the installation is made inside a virtual environment and to do this, one can use conda (recommended in order to control the Python version).

Using conda

The tool conda, which comes bundled with Anaconda has the advantage that it lets us specify the version of Python that we want to use. Python=3.8 is required.

After locating in the local github folder, like cd $PATH$ e.g. Documents/Local_Github/cheap_learning, a new environment can be created with

$ conda env create -f environment.yaml

The environment's name will be cheap_learning.

2. Data sets

2.1 IMDB movie review sentiment

First introduced in (Maas et al, 2011), the data set contains 50,000 movie reviews from IMDb labelled according to whether they have a positive sentiment or negative sentiment (0: negative sentiment (50%), 1: positive sentiment (50%))

The entire data set is located in the subfolder data/binary_movie_sentiment. In it, we distinguish between:

The clean data splits
The raw data
An unbalanced version of the data splits with a ratio of 12% Positive vs 88% Negative reviews

2.1.1 TMDb data set

Given that we cannot be sure that the IMDb movie reviews data set is part of the training data set of the GPT-3.5 and GPT-4.0 models, we collected and tested an analogous dataset of movie reviews from TMDb. This data set contains 855 movie reviews published after October 2021 (passed the GPT training date cut) with a ratio of 73.3% of positive reviews and 26.7% of negative reviews.

The data set can be found in data/tmdb. The scraper script is found in src/tmdb-database.py

2.2 Wikipedia Detox

First introduced in (Wulczyn et al, 2017), the data set contains 115,864 comments from the English language Wikipedia labelled according to whether they contain a personal attack or not (0: no personal attack (88.3%), 1: contains personal attack (11.7%)).

The entire data set is located in the subfolder data/binary_abuse. In it, we distinguish between:

3. Techniques

Each one the techniques, with the exception of the the zero-shot Prompt Engineering classification using GPT-3.0, 3.5, 4.0 have a bash script that deploys the training of each technique.

3.1 Naive Bayes

To deploy the training with Naive Bayes, please run bash ./src/naive_bayes_train_script.sh.

The bash script calls src/naive_bayes_classifier.py.

3.2 Weak Supervision

To deploy the training with Weak Supervision, please run bash ./src/weak_supervision_script.sh.

The bash script calls src/weak_supervision.py and the dictionary of labeling functions, found in src/labeling_functions.py.

In particular, for the binary abuse task, Weak Supervision also uses the annotated keywords in data/binary_abuse/misc.

We train the model defined by the authors of Weak Supervision:

LabelModel

3.3 Transfer Learning

To deploy the training with Transfer Learning, please run bash ./src/transfer_learning_train_script.sh.

The bash script calls src/transfer_learning.py.

We train two models:

DistilBERT
DeBERTa-v3

3.4 Prompt Engineering

To deploy the training with Prompt Engineering, please run bash ./src/prompt_engineering_train_script.sh.

The bash script calls src/prompt_engineering.py.

We use three different prompts for each of the two datasets:

For IMDB movie review sentiment:

"Is this text negative?"
"Does this text contain negative sentiment?"
"It was? Negative or not negative?"

For Wikipedia Detox:

"Is this text abusive?",
"Does this text contain abuse?"
"It was? Abusive or Not Abusive"

We train two models:

DistilBERT
GPT-2.0

3.5 Zero-shot classifier using GPT

We also perform a zero-shot classification exercise (no training) with the out-of-the-box OpenAI LLMs, GPT-3.0, 3.5 and 4.0.

The script of the zero-shot exercise can be found in the jupyter notebook open_ai_prompt_engineering.ipynb.

We use three different prompts for each of the two datasets:

For IMDB movie review sentiment:

"Using one word, classify the sentiment of the movie review using 'Positive' or 'Negative'."
"Using one word, does the movie review contain negative sentiment, Yes or No?"
"You are a researcher who needs to classify movie reviews as containing negative sentiment or not containing negative sentiment. Using one word, does the movie review contain negative sentiment, Yes or No?"

For Wikipedia Detox:

"Using one word, does the internet comment contain toxic language, Yes or No?"
"Using one word, is this internet comment using toxic language, Yes or No?"
"You are a researcher who needs to classify comments on the internet as containing abusive language or not containing abusive language. Using one word, does the internet comment contain abusive language, Yes or No?"

4. Results

A collection of csv files with all the results can be found in results. The full results are stored in task_results_final.csv.

5. Analysis

Analysis is done via the jupyter notebooks results_analysis.ipynb` and plot_manuscript_figures.ipynb

6. Contact

For any questions, please contact Jonathan Bright - jbright@turing.ac.uk

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
data		data
results		results
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cheap Learning

Index

1. Installation

Using conda

2. Data sets

2.1 IMDB movie review sentiment

2.1.1 TMDb data set

2.2 Wikipedia Detox

3. Techniques

3.1 Naive Bayes

3.2 Weak Supervision

3.3 Transfer Learning

3.4 Prompt Engineering

3.5 Zero-shot classifier using GPT

4. Results

5. Analysis

6. Contact

About

Releases

Packages

Contributors 7

Languages

Turing-Online-Safety-Codebase/cheap_learning

Folders and files

Latest commit

History

Repository files navigation

Cheap Learning

Index

1. Installation

Using conda

2. Data sets

2.1 IMDB movie review sentiment

2.1.1 TMDb data set

2.2 Wikipedia Detox

3. Techniques

3.1 Naive Bayes

3.2 Weak Supervision

3.3 Transfer Learning

3.4 Prompt Engineering

3.5 Zero-shot classifier using GPT

4. Results

5. Analysis

6. Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages