LocalRAG

This project is a Local Retrieval Augmented Generation (RAG) pipeline built from scratch without using frameworks such as Langchain. The pipeline is connected to a local LLM and is deployed as a chatbot via Gradio. The source material is "Human Nutrition: 2020 Edition"

Models Used

Embedding Model: all-mpnet-base-v2
LLM Model: Gemma instruction-tuned (the specific type will be automatically selected depending on hardware capabilities)

Examples of the chatbot in action

Screenshot

Gif (original Size)

Gif (compressed and sped up with 1.5x speed)

Requirements

PyMuPDF==1.23.26
matplotlib==3.8.3
numpy==1.26.4
pandas==2.2.1
Requests==2.31.0
sentence_transformers==2.5.1
spacy
tqdm==4.66.2
transformers==4.38.2
accelerate
bitsandbytes
jupyter
wheel
gradio
huggingface-hub

Usage

If Run Locally

Create Environment

type this command in terminal/cmd/conda prompt:
conda env create -f environment.yml

Install Requirements

pip install -r requirements.txt

Run either main.py or app.py

python main.py or python app.py Where app.py contains the gradio deployment for this project and main.py runs the project through user input the terminal.

Run on Google Colab

Upload run_on_colab.ipynb to google colab
Clone this repo to google drive
open run_on_colab.ipynb
adjust the path in google colab accordingly
run the cell blocks

Note

This projects requires a CUDA-compatible GPU to run

To-Do

Enable the chatbot to respond using query history as well
Improve text preprocessing to get better RAG performance
Integrate a re-ranker model to get better RAG results
Improve prompt

Contributions

Many thanks to Daniel Bourke for the video guidance on this project
Many thanks to the University of Hawai‘i at Mānoa Food Science and Human Nutrition Program for the open source textbook "Human Nutrition: 2020 Edition" which was used as the source material for this project

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
embeddings		embeddings
llm_infer		llm_infer
source_material		source_material
text_preprocessing		text_preprocessing
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py
environment.yml		environment.yml
main.py		main.py
requirements.txt		requirements.txt
run_in_colab.ipynb		run_in_colab.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LocalRAG

Models Used

Examples of the chatbot in action

Screenshot

Gif (original Size)

Gif (compressed and sped up with 1.5x speed)

Requirements

Usage

If Run Locally

Create Environment

Install Requirements

Run either main.py or app.py

Run on Google Colab

Note

To-Do

Contributions

About

Releases

Packages

Contributors 2

Languages

josepharielct/LocalRAG

Folders and files

Latest commit

History

Repository files navigation

LocalRAG

Models Used

Examples of the chatbot in action

Screenshot

Gif (original Size)

Gif (compressed and sped up with 1.5x speed)

Requirements

Usage

If Run Locally

Create Environment

Install Requirements

Run either main.py or app.py

Run on Google Colab

Note

To-Do

Contributions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages