LocalLLaMA

14 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Qwen-72B released (huggingface.co)

submitted 2 years ago by PookaMacPhellimen@alien.top to c/localllama@poweruser.forum

39 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] PookaMacPhellimen@alien.top 1 points 2 years ago (3 children)

https://preview.redd.it/sdofti9odg3c1.jpeg?width=1792&format=pjpg&auto=webp&s=d6f56d56c3596924ea61e1e5429018c0222907d2

Amazing capabilities on some benchmarks if true.

[–] Secret_Joke_2262@alien.top 1 points 2 years ago (1 children)

What do these tests mean for LLM? There are many values, and I see that in most cases qwen is better than gpt4. In others it is worse or much worse

[–] rileyphone@alien.top 1 points 2 years ago

All the cases it is better than GPT-4 are benchmarks involving Chinese language. OpenAI is going to have a hard time getting access to extensive Chinese language datasets so it's not surprising a 72B model can beat GPT-4, though it's still impressive in it's own right.

[–] Disastrous_Elk_6375@alien.top 1 points 2 years ago

big if true

[–] a_slay_nub@alien.top 1 points 2 years ago (1 children)

Bit disappointed by the coding performance but it is a general use case model. It's insane how good gpt 3.5 is for how fast it is.

[–] ambient_temp_xeno@alien.top 1 points 2 years ago

Apparently the chat version has about 64 for humaneval.