RogueStargun

joined 10 months ago

What is Q* and how do we use it? in c/localllama@poweruser.forum

[–] RogueStargun@alien.top 1 points 10 months ago (2 children)

Q* is just a reinforcement learning technique.

Perhaps they scaled it up and combined it with LLMs

Given their recently published paper, they probably figured out a way to get GPT to learn their own reward function somehow.

Perhaps some chicken little board members believe this would be the philosophical trigger towards machine intelligence deciding upon its own alignment.

permalink
fedilink
source

Down to memory lane, 2022 - "Google's LaMDA Ai is sentient, I swear" in c/localllama@poweruser.forum

[–] RogueStargun@alien.top 1 points 10 months ago

Damn that was only a year ago? It feel like EONS ago

permalink
fedilink
source