DetectiNator - Image Detection and Captioning App

A Streamlit web application that performs object detection using YOLOv4 and generates image captions using BLIP transformer model.

Features

Upload images for object detection
Real-time object detection using YOLOv4
Image captioning using BLIP (Salesforce)
Clean and intuitive user interface
Object detection confidence scores
Automatic image resizing for optimal processing

Installation

Clone the repository:

git clone <repository-url>
cd DetectiNator

Install the required dependencies:

pip install -r requirements.txt

Required packages:

streamlit
transformers
PIL
torch
cvlib
opencv-python (cv2)
numpy

Usage

Run the Streamlit app:

streamlit run main.py

Upload an image using the file uploader
Click "Detect" to perform object detection
View the detected objects with bounding boxes
Read the automatically generated caption describing the scene

How It Works

Image Upload: Users can upload images in PNG or JPG format
Object Detection: Uses YOLOv4 model through cvlib to detect common objects
Visualization: Displays detected objects with bounding boxes
Captioning: Generates descriptive captions using BLIP transformer model
Display: Shows both the annotated image and generated caption

Technical Details

Object Detection: YOLOv4 (via cvlib)
Image Captioning: BLIP (Salesforce/blip-image-captioning-base)
Frontend: Streamlit
Image Processing: OpenCV
Deep Learning: PyTorch

Requirements

Python 3.7+
Adequate RAM for model inference
GPU recommended for faster processing
Internet connection for model downloads

Limitations

Supports only static image processing
Limited to common object detection
Requires stable internet for first-time model downloads

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
main.py		main.py
object_detection.jpg		object_detection.jpg
requirements.txt		requirements.txt
ss.png		ss.png
test.jpg		test.jpg
test2.jpg		test2.jpg
test3.jpg		test3.jpg
test4.jpg		test4.jpg
test5.jpg		test5.jpg
test6.jpg		test6.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DetectiNator - Image Detection and Captioning App

Features

Installation

Usage

How It Works

Technical Details

Requirements

Limitations

License

About

Releases

Packages

Languages

Rishav-Raj-Sinha/Image-detection-and-Caption-generation

Folders and files

Latest commit

History

Repository files navigation

DetectiNator - Image Detection and Captioning App

Features

Installation

Usage

How It Works

Technical Details

Requirements

Limitations

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages