Skip to content

Work with Tensorflow Hub on Google Summer of Code 2019. Features prototype ULMFiT implementation, pretrained embedding exporter and a Bangla text classification notebook.

Notifications You must be signed in to change notification settings

Tahsin-Mayeesha/gsoc2019

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GSOC 2019

Part 1

  • Minor fixes in Text Classification With Kaggle Notebook. Link here.

Part 2

Demonstration notebook that uses the embedding exporter tool to classify articles in Bangla using FastText embeddings.

Part 3

Prototype module that tries to implement ULMFiT technique from Fastai NLP research. It implements a language model module that is trained on Wikitext-103 dataset, then finetunes on IMDB sentiment classification dataset to classify imdb reviews. Still buggy implementation. Details are in the notebooks and scripts included above.

Training loss of language model :

About

Work with Tensorflow Hub on Google Summer of Code 2019. Features prototype ULMFiT implementation, pretrained embedding exporter and a Bangla text classification notebook.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published