How to Build a Better RAG 101

min/maxing chunk size
- larger chunks -> better data BUT longer load times for LLMs & could have irrelevant info
- smaller chunks -> less data BUT shorter load times for LLMs
- High-level tasks like summarization requires bigger chunk size and low-level tasks like coding requires smaller chunks
pre-processing
- before data is fed into the database, it must be stripped of 'stop' words & special characters
  - html tags, 'the', 'a', general rubbish
- improve quality of indexed data
- replace pronouns with names if possible (increases sementic search results)
- metadata
  - allows for future optimizations of time & data source in retrevial
buckets
- store different topics into different DBs - aka 'buckets'
"Sentence-Window Retrival"
- using metadata, we extract the parent chunk of the chunk that embedding search results us with
Query Rewritting
- use LLM to rephrase a user's layered, multi-use question to n-queries
- Multi-Query Retrival
  - using n-queries generated from user's one query - we gather appropiate sources for all of the data they request
- Step-Back Prompting
  - rewriting original question to first gather context on topic, nessary information not specified in original query
Hybrid Search Exploration
- include alternate methods of searching
- keyword search, semantic search, vector search, etc.
- use a "sparse retriever" like BM25 or TF-IDF with a dense retriever (embedding)
Re-Rank & Filter Documents before sending to LLM
- despite having a high score in db, does not mean its a good match
- rerank using smth like Cohere or HuggingFace & filter out those that you dont need
Document Compressors
- small LLMs that remove noise from retrieved documents
- returning only what is important in the context of the given query
- can also straight up remove a document

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notes.md

notes.md

How to Build a Better RAG 101

Files

notes.md

Latest commit

History

notes.md

File metadata and controls

How to Build a Better RAG 101