Skill Prediction Model

This repository contains a fine-tuned T5-small model that performs "Skill Prediction." It takes a piece of text, such as a Job Description (JD) or Resume, and extracts hard technical skills from the input text.

Overview

The Skill Prediction model aims to automatically extract hard technical skills from a given text, such as resumes or job descriptions. The model is based on the T5-small architecture and has been fine-tuned on a dataset consisting of annotated skills.

Model Architecture

The model uses the T5-small architecture, which is a Transformer-based encoder-decoder model. It has been fine-tuned for the task of text-to-text transformation, where the input is a text corpus and the output is a list of hard technical skills.

Why T5?

T5 (Text-to-Text Transfer Transformer) treats all NLP tasks as a text-to-text problem, making it versatile for various tasks, including text generation and extraction.
The small variant is computationally efficient, making it suitable for fine-tuning with limited resources.

Dataset

The training dataset consists of text samples paired with annotated technical skills.

Dataset Format

The dataset is stored in the following format:

resume_text: The input text (e.g., JD or resume).
skills: The annotated technical skills.

Training

The fine-tuning process involves training the T5-small model using the Hugging Face Transformers library.

Training Steps

Load the Dataset: Use the provided dataset with resume_text and skills.
Tokenization: Use the T5 tokenizer to tokenize the input text and target text.
Fine-tuning: Train the model using standard training arguments for text generation.
Evaluation: Evaluate the model's performance on the test set.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
groq_api		groq_api
model_train		model_train
.gitignore		.gitignore
README.md		README.md
extracted_skills.csv		extracted_skills.csv
requirements.txt		requirements.txt
t5-finetuned-yo.ipynb		t5-finetuned-yo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skill Prediction Model

Table of Contents

Overview

Model Architecture

Why T5?

Dataset

Dataset Format

Training

Training Steps

About

Languages

abhie7/skill-prediction-model

Folders and files

Latest commit

History

Repository files navigation

Skill Prediction Model

Table of Contents

Overview

Model Architecture

Why T5?

Dataset

Dataset Format

Training

Training Steps

About

Topics

Resources

Stars

Watchers

Forks

Languages