Skip to content

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph. It requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the …

Notifications You must be signed in to change notification settings

stgstg27/Caption-Generator-of-Image

Repository files navigation

Caption Generator of Image

A good dataset to use when getting started with image captioning is the Flickr8K dataset.

The reason is because it is realistic and relatively small so that you can download it and build models on your workstation using a CPU.

Getting Dataset

A good dataset to use when getting started with image captioning is the Flickr8K dataset.

The reason is because it is realistic and relatively small so that you can download it and build models on your workstation using a CPU.

You can use the link below to request the dataset: https://illinois.edu/fb/sec/1713398 Within a short time, you will receive an email that contains links to two files:

Flickr8k_Dataset.zip (1 Gigabyte) An archive of all photographs. Flickr8k_text.zip (2.2 Megabytes) An archive of all text descriptions for photographs.

About

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph. It requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the …

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages