Skip to content

SachinPrasanth777/VisualSense

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VisualSense

VisualSense

Welcome to the VisualSense repository! The problem being addressed by this project is the significant barrier faced by visually impaired individuals in comprehending visual content in their surroundings. Whether encountering images in real-time through a camera or uploading pictures for analysis, visually impaired individuals often struggle to understand the contents of these images without sighted assistance. This project seeks to bridge this gap by developing a web application that utilizes Visual Question Answering (VQA) technology. Through VQA, users can interactively ask questions about images, enabling them to gain a better understanding of the visual content independently. By providing this tool, the project aims to enhance the accessibility of visual information for visually impaired individuals and promote their autonomy and inclusion.

VisualSense

VisualSense

Features:

  • Real-time Image Analysis: Instantly analyze images from the device's camera.

  • Interactive Questioning: Users ask questions about image content using natural language.

  • AI-driven Answering: Spoken answers generated based on image analysis and user queries.

  • Flexible Image Upload: Ability to upload images for analysis from device storage.

  • Accessibility Features: User-friendly interface with compatibility for screen readers.

  • Promotes Independence: Empowers visually impaired individuals to access visual information independently.

  • Inclusion: Reduces reliance on sighted assistance, fostering greater inclusion.

🛠 Installation Steps:

1. Front-End

cd frontend
npm i
npm run dev

2. Back-End

cd backend
uvicorn index:app --reload

💻 Built with

Technologies used in the project:

  • PyTorch
  • FastAPI
  • NextJS
  • Hugging Faces Transformers
  • Gemini API

If you encounter any issues or have suggestions for improvements, please create an issue or pull request on GitHub.

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.3%
  • Other 0.7%