Skip to content

Summarize and perform RAG on PPTx/PPT file formats

Notifications You must be signed in to change notification settings

connectaman/RAGAlchamy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RAGAlchamy

RAGAlchamy is a groundbreaking Python package designed to revolutionize the way you interact with PowerPoint presentations. Our package empowers you to effortlessly extract and manipulate a wide range of content within PPT files, including text, charts, and even perform OCR (Optical Character Recognition) on images embedded in your presentations.

Installation

  1. Install / Check python verion -> python>=3.10
  2. Install all required packages
pip install -r requirements.txt
  1. Install tesseract
    • Windows
      pytesseract.pytesseract.tesseract_cmd = r'PATH TO tesseract.exe'
      
    • Linux
      sudo apt update
      sudo apt-get install tesseract-ocr
      
  2. Run the sample notebook