this post was submitted on 30 Oct 2023
1 points (100.0% liked)
LocalLLaMA
3 readers
1 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
But won't it increase the inference time quite a bit? Or are there any GitHub projects to get started with this?
THAT is the cost side and that is a NASTY one. It is not only the financial - but it is, as you point out - the response time. And it is NOT just inference, you also have all the lookup that must happen.
But yes. This is where the price is paid that shows that we are still a factor if 10 or 20 away from fast interactive complex data AI.
But - do not worry, we get there ;)
No github I am aware of - people are very happy with their naive little innovation and never see the real problems in their simplistic tests. It is an 80/20 or higher order problem - MOST things work simple, SOME - ah - well ;) YOu also get into the "smalltalk" - you do not want to run a full research cycle when the user input is "Thank you, that was helpful" ;)
That said, really, if AI gets 10x faster (and it looks like hard+software is on the way for more than that) it is easily doable from the time side.