tinyburger1

joined 10 months ago
[–] tinyburger1@alien.top 1 points 10 months ago (1 children)

But won't it increase the inference time quite a bit? Or are there any GitHub projects to get started with this?

 

I'm trying to build an application using RAGs. I know how RAGs help ground the responses and all, but how do I handle generic queries from users which have nothing to do with what's stored in my vector database? For example, queries such as: "How many gold medals did China win during Tokyo Olympics?" vs "Parapharse this email for me: ... ". I would assume LLMs without RAGs would do a much better job answering the second question.

How do people usually handle these scenarios? Are there any tools that I can look at? Any help would be greatly appreciated. Thank you.