Skip to content

A merge model architecture made up of VGG19 and LSTMs to generate a brief caption of an input image.

License

Notifications You must be signed in to change notification settings

jaykshirsagar05/captionify

Repository files navigation

Captionify

Open In Colab

Description

A webapp which can generate brief captions from images. We have used a merge model similar to "Show and tell architecture" to generate brief captions. We trained model on flickr8k dataset with the help of google colab.

We have used a python based API framework named FastAPI and for frontend we used Streamlit framework.

Visuals

Below is the demo output of our model

alt_text

Installation

Basic requirements

  • Python3
  • Docker

Deploy as a docker image and run

Steps to run in your local system:

git clone https://github.com/jaykshirsagar05/captionify.git
cd Captionify
Docker-compose build
Docker-compose up

visit to http://172.19.0.3:8501/ for streamlit app.

visit to for http://127.0.0.1:8000/docs server side(fastapi)

NOTE: You need to change the path of pre-trained model in caption.py file.

Roadmap

This project is open sourced.

Contributing

Anyone is welcomed to contribute to this project.

Authors and acknowledgement

Authors:

Jay Kshirsagar

Qiaochu Xiong

Acknowledgements:

Project Status

Our project is mostly completed, but the prediction of model is not accurate. Further improvement is welcomed!

About

A merge model architecture made up of VGG19 and LSTMs to generate a brief caption of an input image.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published