LocalLLaMA

14 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

100B, 220B, and 600B models on huggingface! (alien.top)

submitted 2 years ago by Illustrious_Sand6784@alien.top to c/localllama@poweruser.forum

44 comments fedilink hide all child comments

https://huggingface.co/deepnight-research

I'm not affiliated with this group at all, I was just randomly looking for any new big merges and found these.

100B model: https://huggingface.co/deepnight-research/saily_100B

220B model: https://huggingface.co/deepnight-research/Saily_220B

600B model: https://huggingface.co/deepnight-research/ai1

They have some big claims about the capabilities of their models, but the two best ones are unavailable to download. Maybe we can help convince them to release them publicly?

you are viewing a single comment's thread
view the rest of the comments

[–] noeda@alien.top 1 points 2 years ago (2 children)

Some quotes I found on the pages:

"No! The model is not going to be available publically. APOLOGIES. The model like this can be misused very easily. The model is only going to be provided to already selected organisations."

"[SOMETHING SPECIAL]: AIN'T DISCLOSING!🧟"

"Hallucinations: Reduced Hallucinations 8x compared to ChatGPT 🥳"

My guess: it's just another merge like Goliath. At best it's marginally better than a good 70B.

I can also "successfully build 220B model" easily with mergekit. Would it be good? Probably not.

The lab should write on their model card why should I not think it's just bullshit. Not exactly the first mystery lab making big claims.

[–] VertexMachine@alien.top 1 points 2 years ago

I doubt there's any model there.

[–] PookaMacPhellimen@alien.top 1 points 2 years ago

Wonder if GPT4 is just a series of merges