this post was submitted on 29 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

Set aside benchmarks, if you had to choose one to use instead of ChatGPT for the next 6 months, which one would you pick? Recently, I've been experiencing some extreme slow down and poor answers on GPT so I'm going to run a local backup for the time being to assist when GPT4 is down. I'm leaning towards Mistral. I can be convinced to test some others, though.

top 12 comments
sorted by: hot top controversial new old
[–] Linkology@alien.top 1 points 11 months ago (1 children)

Mistral has been quite good at multiple tasks I throw at it given its small size. But for specific tasks some models can work better

[–] toothpastespiders@alien.top 1 points 11 months ago

I'm still shocked at how good mistral is. I wrote it off as a meme model for far too long just because of how overstated the praise seemed to be. But the thing really is amazing for the size.

[–] Dravodin@alien.top 1 points 11 months ago

There is an upcoming NeuralHermes-2.5-Mistral-70B, chances are it will also have vision version as well. Looking at really impressive performance of 7B version. I think 70B will set new benchmarks in OSS AI world. But, there are plenty of other models as well. You should choose according to your use case.

[–] toothpastespiders@alien.top 1 points 11 months ago (1 children)

Just one, and assuming no extra training? I think I'd go with Capybara Tess Yi 34b. In part because of how well it seems to follow instructions. But also because it has the broadest scope of knowledge that I've seen in any of the models so far. A lot of the models tap out on a lot of things past what you'd get from the first paragraph of Wikipedia. I get that feeling far less often with capy so far.

[–] Feeling-Currency-360@alien.top 1 points 11 months ago

OpenHermes-2.5-Mistral-7B-16k imo

[–] kitkatmafia@alien.top 1 points 11 months ago

Wizard-lm 13b

[–] RiotNrrd2001@alien.top 1 points 11 months ago

I keep trying new models, and I keep going back to Dolphin-Mistral-2.2.1. There is something about the quality of the interactions that is different from the other models, and is, I don't know, unexplainably better. I cannot identify why this remains in my mind the best model of all models I've tested, clear up to 33b (the largest my pitiful machine will load), but I continue to think this. Now, I haven't tested every model, so my opinion is completely anecdotal. Dolphin just kicks it, though. It just does such a good job at almost everything I throw at it. I won't say it doesn't foul up here and there, but it still blows the other small models out of the water as far as I'm concerned.

[–] Dazzling_Ad1507@alien.top 1 points 11 months ago (1 children)

The Best? Id think a goliath 120B finetune like Tess XL

[–] A0sanitycomp@alien.top 1 points 11 months ago (1 children)

Lol 😂 I’d need an upgrade

[–] Dazzling_Ad1507@alien.top 1 points 11 months ago (1 children)

You can always test it out using runpod for around ~2 dollars an hour

[–] fimbulvntr@alien.top 1 points 11 months ago

OpenRouter has it, you can test it for free using their chat interface, or for $5 if you want API access. No need to install or download anything.