Transformer-based Chat Model

Overview

This repository contains a Transformer-based chat model implemented using TensorFlow and Keras. The model is trained to predict answers based on provided background and question text.

File Structure

ChatBot_architecture_from_scratch.ipynb: The Python script containing the model and associated functions.

How to Use

Setup

Install the required Python packages: pandas, numpy, tensorflow, scikit-learn.
```
pip install pandas numpy tensorflow scikit-learn
```
Place your dataset.csv file in the same directory as ChatBot_architecture_from_scratch.ipynb.

Run the Model

Navigate to the folder containing ChatBot_architecture_from_scratch.ipynb.
Rename ChatBot_architecture_from_scratch.ipynb to ChatBot_architecture_from_scratch.py
Run the script to train the model and to use the chat function. Optionally, we can proced with that as .ipynb file, however, we recommend to use rename it to .py as it make it more straightforward taks and better seuted for this example/tutorial/process.
```
python ChatBot_architecture_from_scratch.py
```

Code Explanation

The script starts by importing necessary modules and loading the dataset.
A Tokenizer is trained on the text data.
Text sequences are tokenized and padded.
The data is split into training and validation sets.
A Transformer model is created using Keras layers.
The model is compiled and trained.
A chat function (chat()) uses the trained model to predict answers.

Customizing the Model

To Use Different Tokenization

You can replace the Tokenizer with any other tokenization method or library. Make sure to update the tokenize_and_pad() function accordingly.

To Use Different Model Architectures

We can replace the Transformer encoder with other model architectures like LSTM, GRU, or custom layers. Make sure to update the model creation and compilation steps.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
Web UI		Web UI
ChatBot_architecture_from_scratch.ipynb		ChatBot_architecture_from_scratch.ipynb
Data_Analysis_and_Trivial_modeling.ipynb		Data_Analysis_and_Trivial_modeling.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer-based Chat Model

Overview

File Structure

How to Use

Setup

Run the Model

Code Explanation

Customizing the Model

To Use Different Tokenization

To Use Different Model Architectures

About

Releases

Packages

Languages

License

ViktorVeselov/Chatbot-and-UI-Architecture

Folders and files

Latest commit

History

Repository files navigation

Transformer-based Chat Model

Overview

File Structure

How to Use

Setup

Run the Model

Code Explanation

Customizing the Model

To Use Different Tokenization

To Use Different Model Architectures

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages