this post was submitted on 27 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

https://preview.redd.it/3krgd1sg2z2c1.png?width=800&format=png&auto=webp&s=b76c5fb9fa22938c74ec3095f63adaec8ff2219d

I came across this new finetuned model based on Openchat 3.5 which is apparently trained used Reinforcement Learning from AI Feedback (RLAIF).

https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha

Check out this tweet: https://twitter.com/bindureddy/status/1729253715549602071

you are viewing a single comment's thread
view the rest of the comments
[–] bot-333@alien.top 1 points 11 months ago (6 children)

"New RLAIF Finetuned 7b Model" Interesting. "beats Openchat 3.5" Nice! "and comes close to GPT-4" Bruh.

[–] Evening_Ad6637@alien.top 1 points 11 months ago (5 children)

heheh i can't read that any more.. i really have become very prejudiced when comes to that.. to be honest, when it comes to any comparison with GPT-4.

People have really to understand that even GPT-4 has been aligned, lobotomized and it has been massively downgraded in terms of its perfomance – due to security reasons (what is understandable for me), but anyway this thing still is an absolute beast. if we consider all the restrictions GPT-4 has to undergo, all the smartness at openAI, all the ressources at microsoft and so on, we have to realize that currently nothing is really comparable to GPT-4. Especially not 7B models.

[–] noeda@alien.top 1 points 11 months ago (4 children)

I've seen the "... beats GPT-4" enough times that now whenever I see a title that suggests a tiny model can compete with GPT-4 I see it as a negative signal; that the authors are bullshitting through some benchmarks or some other shenanigans.

It's annoying because the models might be legitimately good models for being open and within their weight class but now you've put my brain in BS detecting mode and I can't trust you've done good faith measurement anymore.

[–] bot-333@alien.top 1 points 11 months ago

There are SO many models "bullshitting through some benchmarks or some other shenanigans" that I'm cooking my own benchmark system LOL.

load more comments (3 replies)
load more comments (3 replies)
load more comments (3 replies)