💬 RAG chatbot

A simple streamlit app that does RAG

How to run it on your own machine

setup your environment secrets, refer to .streamlit/secrets.toml.example

Install the requirements

$ cd src
$ pip install -r requirements.txt

Run the app
```
$ streamlit run Chat.py
```

How to setup your own remote llm endpoint with ollama and modal labs

Disclaimer: This backend server is strictly for demonstrating a proof-of-concept for running LLMs on modal labs via ollama, you are encouraged to support Modal labs via their payed services for a better experience, support, and system scalability

Setup an account on modal labs
modal labs sign-up page
Pull this repo and redirect to the project directory using the terminal// command line interface
```
$ git clone https://github.com/ProtoFaze/chatbot.git   
$ cd chatbot
```
Install modal lab's python library using pip.
```
$ pip install modal
```
Follow the instructions in your browser to authenticate your computer for controlling your modal account
```
$ modal setup
```
Change your directory again to the backend codebase and deploy the server
```
$ cd ollama_backend
$ modal deploy ollama_backend.py
```
Your terminal / command line interface should show a url to access the server
In your frontend, navigate to the settings page, and fill in the previous url into the ollama endpoint field and submit.
Try running your own server in the chat page now.

Salesman Chatbot

This project is a step by step development project of an insurance salesman chatbot,
The chatbot should be able to access the corpus of data related to a product it is advertising,
The bot POC is set up using open source and free tier options, no free trials from providers required.

Tech/AI stack

POLM
Python version 3.12
Ollama for edge device language model hosting
LlamaIndex for DAQ and indexing
Modal labs for hosting asgi web endpoints and llm inference
MongoDB Atlas for data storage

concepts

RAG
Structured LLM output

Further Improvements

pdf processing pipeline (via notebook)
structured data corpus fetch
basic chatlog analysis

Acknowledgement

Thanks for sharing these demos and blogs
using streamlit with ollama for prototyping
demo of streamlit with ollama

using llamaparse with ollama
the repo
the blog article

using ollama as a freemium backend service
run ollama with modal

setting up response streaming via fastAPI(compatible with modal labs)
FastAPI Streaming Response: Error: Did not receive done or success response in stream

using structured outputs on ollama
Structured outputs

sharing how to control page visibility on streamlit
Hide/show pages in multipage app based on conditions

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.streamlit		.streamlit
ollama_backend		ollama_backend
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💬 RAG chatbot

How to run it on your own machine

How to setup your own remote llm endpoint with ollama and modal labs

Salesman Chatbot

Tech/AI stack

concepts

Further Improvements

Acknowledgement

About

Releases

Packages

Languages

License

ProtoFaze/chatbot

Folders and files

Latest commit

History

Repository files navigation

💬 RAG chatbot

How to run it on your own machine

How to setup your own remote llm endpoint with ollama and modal labs

Salesman Chatbot

Tech/AI stack

concepts

Further Improvements

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages