Image Captioning System to Assist The Blind

About

The goal of the project is to develop a system using deep learning techniques to assist visually impaired individuals in obtaining information by describing images taken by them. The system uses a CNN model and an NLP model to create a single image captioning system that takes image features as input and generates a text sequence describing the image.

Incorporated state-of-the-art pre-trained models, such as ResNet50, VGG16, and VGG19, for image feature extraction and LSTM and Bidirectional LSTM for text generation. Evaluated various models to determine the best-performing model with a BLEU-score of 0.61 and deployed it using Flask and pyttsx3 for web and text-to-speech functionality in the app.

Getting Started

These instructions will get you a copy of the project up and running on your local machine.

Clone the project repository from GitHub:

git clone https://github.com/ammarlodhi255/image-captioning-system-to-assist-the-blind.git

Navigate to the project directory:

cd image-captioning-system-to-assist-the-blind

Create a virtual environment for the project:

python3 -m venv env

Activate the virtual environment:

source env/bin/activate

Export the Flask app:

export FLASK_APP=app.py

Run the Flask app:

flask run

Screenshots

Dataset Split

Model Anatomy

Project Workflow

Results

Final Outcome

Additional Outputs

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
Project Report		Project Report
model		model
public		public
screenshots		screenshots
static		static
README.md		README.md
app.py		app.py
predict.py		predict.py
tts.py		tts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning System to Assist The Blind

Table of Contents

About

Getting Started

Screenshots

Dataset Split

Model Anatomy

Project Workflow

Results

Final Outcome

Additional Outputs

Contributing

About

Releases

Packages

Contributors 4

Languages

ammarlodhi255/image-captioning-system-to-assist-the-blind

Folders and files

Latest commit

History

Repository files navigation

Image Captioning System to Assist The Blind

Table of Contents

About

Getting Started

Screenshots

Dataset Split

Model Anatomy

Project Workflow

Results

Final Outcome

Additional Outputs

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages