LocalLLaMA

14 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

dolphin-2.2-yi-34b released (alien.top)

submitted 2 years ago by Amgadoz@alien.top to c/localllama@poweruser.forum

34 comments fedilink hide all child comments

Eric Hartford, the author of dolphin models, released dolphin-2.2-yi-34b.

This is one of the earliest community finetunes of the yi-34B.

yi-34B was developed by a Chinese company and they claim sota performance that are on par with gpt-3.5

HF: https://huggingface.co/ehartford/dolphin-2_2-yi-34b

Announcement: https://x.com/erhartford/status/1723940171991663088?s=20

you are viewing a single comment's thread
view the rest of the comments

[–] denru01@alien.top 1 points 2 years ago (1 children)

Which is the best 70B on your list?

[–] WolframRavenwolf@alien.top 1 points 2 years ago (1 children)

I'm still working on the updated 70B comparisons/tests, but right now, the top three models are still the same as in the first part of my Huge LLM Comparison/Test: 39 models tested (7B-70B + ChatGPT/GPT-4): lzlv_70B, SynthIA-70B-v1.5, chronos007-70B. Followed by dolphin-2_2-yi-34b.

[–] Healthy_Cry_4861@alien.top 1 points 2 years ago (1 children)

SynthIA-70B-v1.5 seems to have the same context length of 2k as SynthIA-70B-v1.2, not the same 4k context length as SynthIA-70B-v1.2b

[–] WolframRavenwolf@alien.top 1 points 2 years ago

You're right with your observation, when I load the GGUF, KoboldCpp says "n_ctx_train: 2048". Could that be an erroneous display? Because I've always used v1.5 with 4K context, did all my tests with that, and it's done so well. If it's true, it might even be better with native context! Still, 2K just doesn't cut it anymore, though.