📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

In Part 1 of our Transformer series - Multi-Class Text Classification with Deep Learning using BERT, we explored the evolution of NLP models, from simpler models like Bag of Words (BOW) and TF-IDF to advanced Transformer architectures like BERT.

In Part 2, we dive into two novel architectures that enhance BERT's performance through innovative training and optimization techniques:

RoBERTa: A Robustly Optimized BERT Pretraining Approach
XLNet: Generalized Autoregressive Pretraining for Language Understanding

We'll analyze these models, explore their training methods, and use them to classify human emotions from text data.

📄 Data Description

We use the Emotion dataset from the Hugging Face library, which consists of English Twitter messages labeled with six basic emotions: anger, fear, joy, love, sadness, and surprise.

Dataset Breakdown:

Train: 16,000 rows
Validation: 2,000 rows
Test: 2,000 rows

Labels:

0: sadness
1: joy
2: love
3: anger
4: fear
5: surprise

🛠️ Tech Stack

Language: Python
Libraries: datasets, numpy, pandas, matplotlib, seaborn, ktrain, transformers, tensorflow, sklearn
Environment: Jupyter Notebook, Google Colab Pro (Recommended)

🚀 Approach

Install Libraries: Ensure all necessary libraries are installed.
Load Dataset: Load and explore the Emotion dataset.
Data Preprocessing: Convert datasets to DataFrame and create additional features.
Data Visualization: Use histograms to visualize data distribution.
Model Training:
- RoBERTa:
  - Create and configure the model.
  - Preprocess data, compile the model, and find optimal learning rates.
  - Fine-tune the model and evaluate its performance.
  - Save and test the model.
- XLNet:
  - Similar steps as RoBERTa with additional understanding of Autoregressive and Autoencoder models.
Performance Evaluation: Evaluate both models on test data and compare their metrics.

📂 Project Structure

Modular Code

src: Contains modularized code for the entire project.
- Engine.py: Main script to run the project.
- ML_Pipeline: Folder with functions for data processing and model training.
output: Contains trained models for easy loading and reuse.
lib: Contains Jupyter notebooks and reference materials.

📝 Project Takeaways

Understand business problems in NLP.
Explore Transformer architectures and self-attention mechanisms.
Gain insights into RoBERTa and XLNet models.
Learn data preprocessing and visualization techniques.
Develop and fine-tune Transformer models.
Compare and evaluate model performances.

📦 Setup Instructions

Prerequisites

Ensure git is installed on your machine.

Installation

Clone the repo

git clone https://github.com/Vidhi1290/Text-Classification-with-Transformers-RoBERTa-and-XLNet-Model.git

Navigate to the project directory

cd Text-Classification-with-Transformers-RoBERTa-and-XLNet-Model

Install dependencies

pip install -r modular_code/requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Notebook		Notebook
modular_code		modular_code
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

📄 Data Description

Dataset Breakdown:

Labels:

🛠️ Tech Stack

🚀 Approach

📂 Project Structure

Modular Code

📝 Project Takeaways

📦 Setup Instructions

Prerequisites

Installation

Follow me on:

About

Releases

Packages

Languages

Vidhi1290/Text-Classification-with-Transformers-RoBERTa-and-XLNet-Model

Folders and files

Latest commit

History

Repository files navigation

📚 Transformers in NLP: RoBERTa and XLNet

🎯 Business Objective

📄 Data Description

Dataset Breakdown:

Labels:

🛠️ Tech Stack

🚀 Approach

📂 Project Structure

Modular Code

📝 Project Takeaways

📦 Setup Instructions

Prerequisites

Installation

Follow me on:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages