Skip to content

Persian text generation using GRU model and persian wikipedia dataset

Notifications You must be signed in to change notification settings

MehrabKalantary/Persian-Text-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

Persian Sequence Prediction using GRU Model

In this notebook, we create a model to generate persian text using GRU model and persian wikipedia dataset

Dataset on kaggle

About Dataset

The dataset is huge so we train our model on a small sample.

Encoding Text

At first, we extract all vocabularies and assign a number to each of them.

  • char2index: for encoding purposes
  • index2char: for text generation purposes

Before modeling, we create sequences of data.

Modeling

We use GRU model with 1024 units and a dense layer to predict probability of each character.

Prediction

At the end, we use our model to generate text.

About

Persian text generation using GRU model and persian wikipedia dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published