LocalLLaMA

3 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago

MODERATORS

communick@poweruser.forum

Why can't we just run local reinforcement learning? (alien.top)

submitted 1 year ago by Revolutionalredstone@alien.top to c/localllama@poweruser.forum

10 comments fedilink hide all child comments

Obviously building a big high dimensional language model is hard yes okay.

But once we have one can't we just jiggle weights and run tests? why can't I just download a program to "evolve" my language model?

"Am I just stupid and this is just too trivially easy to be a program?"

peace

you are viewing a single comment's thread
view the rest of the comments

[–] LuluViBritannia@alien.top 1 points 1 year ago (5 children)

I have been generating art with AI. There is an extension meant for exactly that : you literally tell the AI "good" or "bad" for each result, and it affects the weights of the model.

Sadly, it's sheer impossible to run. Reinforcement learning isn't just about "picking a random weight and changing them". It's rewriting the entire model to take your feedback into account. And that, while running the model, which in itself already takes most of your compute resource.

You need a shitton of VRAM and a very powerful GPU to run Reinforcement Learning for images. It's even worse for LLMs, which are much more power-hungry.

Who knows, maybe there will be optimizations in the next years, but as of right now, reinforcement learning is just too demanding.

[–] Bod9001@alien.top 1 points 1 year ago (1 children)

what's the name of the extension?

[–] UnignorableAnomaly@alien.top 1 points 1 year ago

Don't know if it's the same one but I've played with this: https://github.com/dvruette/sd-webui-fabric Doesn't use much VRAM at all and works decently once you get enough Likes and Dislikes. However, as you add more likes/dislikes, generation will slow considerably.

load more comments (3 replies)