Chat with Lex RAG

This project implements a friendly chatbot capable of using Retrieval-Augmented Generation (RAG) to answer questions about specific episodes of the Lex Fridman podcast published on YouTube. The chatbot is implemented using user-friendly Streamlit interface and can also be run locally and free of charge by using Ollama for LLM generation.

The full dataset for the Lex Fridman podcast is available in the folder 'JSON_storage'. Alternatively, you can obtain the dataset autonomously by checking and using the project YouTube Channel Video Tracker and then following the steps in the files create_video_db.ipynb and create_transcript_db.ipynb.

Features

Local database with more than 400 Lex Fridman podcast episodes
Interactive chat interface to explore podcast episodes via Streamlit
RAG-based question answering system
Support for MongoDB and local JSON storage
Step-by-step guide on how to prepare the dataset

Prerequisites

ONE among the following solutions:

Ollama installed locally with some models loaded ( suggested models are gemma2:2b and llama3.1:8b )
OpenAI API key set up
Anthropic API key set up

Getting Started

Clone the repository:

git clone https://github.com/CharlieNestor/chat_with_Lex_RAG.git

Create your own virtual environment and install the dependencies:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Set up environment variables:

Create a .env file in the project root
Add necessary environment variables, for example:

 MONGO_URI=mongodb://admin:password@localhost:27017/
 OPENAI_API_KEY=your_openai_api_key
 ANTHROPIC_API_KEY=your_anthropic_api_key

Usage

Run the Streamlit app:
```
streamlit run rag_video_streamlit.py
```
Select a podcast episode from the sidebar
Click "Load Interview" to prepare the RAG system
Start chatting and asking questions about the selected episode

Notes

This is still a work-in-progress project.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
JSON_storage		JSON_storage
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
LexFridman_videos.json		LexFridman_videos.json
README.md		README.md
create_transcript_db.ipynb		create_transcript_db.ipynb
create_video_db.ipynb		create_video_db.ipynb
docker_compose_lex.yml		docker_compose_lex.yml
mongo_utils.py		mongo_utils.py
rag_single_video.py		rag_single_video.py
rag_video_streamlit.py		rag_video_streamlit.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat with Lex RAG

Features

Prerequisites

Getting Started

Usage

Notes

About

Releases

Packages

Languages

License

CharlieNestor/chat_with_Lex_RAG

Folders and files

Latest commit

History

Repository files navigation

Chat with Lex RAG

Features

Prerequisites

Getting Started

Usage

Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages