Skip to content

Predicting a caption for a given image using Inception Net V3, LSTM and Glove

Notifications You must be signed in to change notification settings

quanpersie2001/ImageCaptioning

Repository files navigation

Image Captioning

Model using Inception Net V3, LSTM and Glove (Using SSD300 to improve feature)

image

How to run?

Install lib

pip install -r requirements.txt

Download data

python data_download.py

Note : Dataset is MS COCO 2014 and Glove <Wikipedia 2014 + Gigaword 5>. This is large dataset, long download.

Preprocess

You must run

python preprocess.py

With COCO datase this command runs for a long time you can download and coppy them to ROOT / process_data

Trainning

python train.py --batch-size 64 --output weights --epochs 30

You can download pre-train model an copy them to ROOT / weights

Predict

python predict.py --image path/to/image --weight path/to/weight --k-beam 9

Result

image You can see sumary in summary.ipynb

About

Predicting a caption for a given image using Inception Net V3, LSTM and Glove

Topics

Resources

Stars

Watchers

Forks