Replies: 3 comments 7 replies
-
I think it's a cool idea. |
Beta Was this translation helpful? Give feedback.
-
Just came across this interesting project: |
Beta Was this translation helpful? Give feedback.
-
I'm thinking through approaches for building "websearch". Perhaps as another tool (like the calculator)? Where there's a swapable search engine adapter (Bing API, Brave, searxng, etc). And where you can specify via confg how many results it should fetch in parallel. But it's unclear how to best handle the next step - semantically chunk the html and vectorize it in memory via Nx? Or dump into Postgres pgvector? Then there's some other things that might need to be done for better reliability like this video describes (regarding the Grader and Hallucination checker): Reliable, fully local RAG agents with LLaMA3 There's also some interesting ideas here: https://docs.tavily.com/blog/building-openai-assistant Thoughts? |
Beta Was this translation helpful? Give feedback.
-
Maybe out of scope but you know what would be cool - doing this sort of thing (Perplexity clone ie., "online" live LLM):
-https://github.com/searxng/searxng (open source search aggregator)
Basically a RAG approach that knows when to run a live search through the searxng aggregator. Perhaps combined with pgvector and the bumblebee harness GPU for a true open source setup. And maybe hooked up to Crawly to get the page content in parallel.
Could make for an interesting open source fly setup (as all the services could be run there).
Or a simpler version using the Bing API: #48
Beta Was this translation helpful? Give feedback.
All reactions