LocalLLaMA

4 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

What is the best multi-purpose model available right now? (alien.top)

submitted 2 years ago by A0sanitycomp@alien.top to c/localllama@poweruser.forum

12 comments fedilink hide all child comments

Set aside benchmarks, if you had to choose one to use instead of ChatGPT for the next 6 months, which one would you pick? Recently, I've been experiencing some extreme slow down and poor answers on GPT so I'm going to run a local backup for the time being to assist when GPT4 is down. I'm leaning towards Mistral. I can be convinced to test some others, though.

top 12 comments

sorted by: hot top controversial new old

[–] Linkology@alien.top 1 points 2 years ago (1 children)

Mistral has been quite good at multiple tasks I throw at it given its small size. But for specific tasks some models can work better

[–] toothpastespiders@alien.top 1 points 2 years ago

I'm still shocked at how good mistral is. I wrote it off as a meme model for far too long just because of how overstated the praise seemed to be. But the thing really is amazing for the size.

[–] Dravodin@alien.top 1 points 2 years ago

There is an upcoming NeuralHermes-2.5-Mistral-70B, chances are it will also have vision version as well. Looking at really impressive performance of 7B version. I think 70B will set new benchmarks in OSS AI world. But, there are plenty of other models as well. You should choose according to your use case.

[–] toothpastespiders@alien.top 1 points 2 years ago (1 children)

Just one, and assuming no extra training? I think I'd go with Capybara Tess Yi 34b. In part because of how well it seems to follow instructions. But also because it has the broadest scope of knowledge that I've seen in any of the models so far. A lot of the models tap out on a lot of things past what you'd get from the first paragraph of Wikipedia. I get that feeling far less often with capy so far.

[–] Sweet_Protection_163@alien.top 1 points 2 years ago

This!

[–] Feeling-Currency-360@alien.top 1 points 2 years ago

OpenHermes-2.5-Mistral-7B-16k imo

[–] kitkatmafia@alien.top 1 points 2 years ago

Wizard-lm 13b

[–] RiotNrrd2001@alien.top 1 points 2 years ago

I keep trying new models, and I keep going back to Dolphin-Mistral-2.2.1. There is something about the quality of the interactions that is different from the other models, and is, I don't know, unexplainably better. I cannot identify why this remains in my mind the best model of all models I've tested, clear up to 33b (the largest my pitiful machine will load), but I continue to think this. Now, I haven't tested every model, so my opinion is completely anecdotal. Dolphin just kicks it, though. It just does such a good job at almost everything I throw at it. I won't say it doesn't foul up here and there, but it still blows the other small models out of the water as far as I'm concerned.

[–] Dazzling_Ad1507@alien.top 1 points 2 years ago (1 children)

The Best? Id think a goliath 120B finetune like Tess XL

[–] A0sanitycomp@alien.top 1 points 2 years ago (1 children)

Lol 😂 I’d need an upgrade

[–] Dazzling_Ad1507@alien.top 1 points 2 years ago (1 children)

You can always test it out using runpod for around ~2 dollars an hour

[–] fimbulvntr@alien.top 1 points 2 years ago

OpenRouter has it, you can test it for free using their chat interface, or for $5 if you want API access. No need to install or download anything.