MT-XLN: Multi-Task XLNet for Natural Language Understanding

COMS 6998 E010 final project, rmg2203 + ka2744

Our goal was to construct a version of MT-DNN that used XLNet for its shared encoder architecture in place of BERT, and to compare performance of the two architectures. We reduced the number of task heads from the original MT-DNN methodology, and focused on tasks with more manageable dataset sizes, to allow for more lightweight and flexible training. Our results are mixed:

Task	CoLA (Matthews Corr.)	STS-B (Pearson/Spearman Corr.)	WNLI (accuracy)
BERT	0.521	0.867/0.86	0.45
XLNet	0.258	0.885/0.88	0.56

mtl-run.ipynb provides the code for creating and running the model. We have also provided a code for testing the performance of our model according to different input maximum sequence length in seq-length.ipynb.

Although we ran the code on selected GLUE tasks, the same code can be extended to the other GLUE task as well. Please refer to the HugingFace library for GLUE tasks (https://huggingface.co/transformers/v2.3.0/examples.html) for more details.

Running this repo

This repository is set up to run on Python 3.8. Make sure your Python environment has a matching version.
Clone this repository.
In the repository directory, run pip install -r requirements.txt.
Run Jupyter, with jupyter notebook or similar.
Run (and experiment with) the notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
Project_Report_ka2744_rmg2203.pdf		Project_Report_ka2744_rmg2203.pdf
README.md		README.md
mtl-run.ipynb		mtl-run.ipynb
requirements.txt		requirements.txt
seq-length.ipynb		seq-length.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MT-XLN: Multi-Task XLNet for Natural Language Understanding

Running this repo

About

Releases

Packages

Contributors 3

Languages

glassworks-projects/mtxln

Folders and files

Latest commit

History

Repository files navigation

MT-XLN: Multi-Task XLNet for Natural Language Understanding

Running this repo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages