text_replicator

Teaching RNNs to replicate text

This code is an adaptation of Andrej Karpathy's The unreasonable effectiveness of Recurrent Neural Networks.
It uses tensorflow with multi layer GRUs and gradient clipping. The model can be set to run continuously (num_epochs=0), and saves the model to file if it finds a better result. Restarting the model will also automatically pick up from where it left off.

One of the inputs to the model is a directory with the input files in text format. I have provided 2 sources in my git - shakespeare and aesop's fables. This should work equally well with code, LATEX, etc.

The code is split into 3 parts

main.py - sets up the model parameters. there are some arguments passed to the model which can be updated using command line flags.
gpu: which gpu to use (if youhave multiple) mode: train/demo and so on.... there are descriptions for each one.
data_batcher.py - loads the data and sets up batches
LSTMCharacterModel.py - sets up the computational graph for the model

Usage: python main.py. arguments are passed as
--gpu=0 --mode=demo etc

I also added n-gram_model.py that generates text using ngrams. Its nice to compare both speed and accuracy side by side. The code should be self explanatory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

text_replicator

Teaching RNNs to replicate text

Files

README.md

Latest commit

History

README.md

File metadata and controls

text_replicator

Teaching RNNs to replicate text