Image Captioning

Model using Inception Net V3, LSTM and Glove (Using SSD300 to improve feature)

How to run?

Install lib

pip install -r requirements.txt

Download data

python data_download.py

Note : Dataset is MS COCO 2014 and Glove <Wikipedia 2014 + Gigaword 5>. This is large dataset, long download.

Preprocess

You must run

python preprocess.py

With COCO datase this command runs for a long time you can download and coppy them to ROOT / process_data

Download here

Trainning

python train.py --batch-size 64 --output weights --epochs 30

You can download pre-train model an copy them to ROOT / weights

Download here

Predict

python predict.py --image path/to/image --weight path/to/weight --k-beam 9

Result

You can see sumary in summary.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
ssd300		ssd300
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
data_download.py		data_download.py
evaluate.py		evaluate.py
model.png		model.png
models.py		models.py
output.png		output.png
predict.py		predict.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
summary.ipynb		summary.ipynb
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning

Model using Inception Net V3, LSTM and Glove (Using SSD300 to improve feature)

How to run?

Install lib

Download data

Preprocess

Download here

Trainning

Download here

Predict

Result

About

Languages

quanpersie2001/ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

Image Captioning

Model using Inception Net V3, LSTM and Glove (Using SSD300 to improve feature)

How to run?

Install lib

Download data

Preprocess

Download here

Trainning

Download here

Predict

Result

About

Topics

Resources

Stars

Watchers

Forks

Languages