Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How can we create RAG dynamicly? #429

Closed
SP4595 opened this issue Jul 8, 2024 · 9 comments
Closed

How can we create RAG dynamicly? #429

SP4595 opened this issue Jul 8, 2024 · 9 comments

Comments

@SP4595
Copy link

SP4595 commented Jul 8, 2024

I am currently facing a challenge with adding new text to RAG. My current approach involves reinitializing the entire RAG, which is slow.

Please provide guidance or suggestions on efficiently adding new text to RAG without requiring complete reinitialization.

@hemangjoshi37a
Copy link

hi . are you using it with openai ? what is the indexing cost for this ?

@hemangjoshi37a
Copy link

I am currently facing a challenge with adding new text to RAG. My current approach involves reinitializing the entire RAG, which is slow.

Please provide guidance or suggestions on efficiently adding new text to RAG without requiring complete reinitialization.

I have same requirement , I want to use this for coding in which everytime i make small changes I have to reindex whole codebase which is very slow . if you find anything please let me know.

@SP4595
Copy link
Author

SP4595 commented Jul 8, 2024

hi . are you using it with openai ? what is the indexing cost for this ?

Sorry, I use the Ollama & llama3-8b. But some YouTubers said openai api will cost a lot.

@kdawgwilk
Copy link

If you just add a new file to the input folder it will incrementally index

@hemangjoshi37a
Copy link

If anyone has it running on ollama please let me know how to set it up.

@Janice0721
Copy link

If you know how to dynamically add text without generating a full index again, please let me know. Thank you

@natoverse
Copy link
Collaborator

If you add new content to the input and do not change your indexing parameters, you should get the benefit of the cache on many of the requests. Changing parameters changes the cache key though, causing a complete reindex.

There are some steps in the pipeline that will re-run due to changed inputs any time you add content.

@hemangjoshi37a
Copy link

If anyone has achieved or can suggest for this . In this we want to continuously index the updated code base in the HippoRAG index and query on the updated index and then make code changes and continuously do so. Here consider that we want to do this offline with ollama type models only and dont want to use OpenAI or Claude .

If anyone can suggest how can i do this ?

@natoverse
Copy link
Collaborator

Consolidating index update requests with #741

@natoverse natoverse closed this as not planned Won't fix, can't repro, duplicate, stale Jul 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants