LocalLLaMA

14 readers

1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

100B, 220B, and 600B models on huggingface! (alien.top)

submitted 2 years ago by Illustrious_Sand6784@alien.top to c/localllama@poweruser.forum

44 comments fedilink hide all child comments

https://huggingface.co/deepnight-research

I'm not affiliated with this group at all, I was just randomly looking for any new big merges and found these.

100B model: https://huggingface.co/deepnight-research/saily_100B

220B model: https://huggingface.co/deepnight-research/Saily_220B

600B model: https://huggingface.co/deepnight-research/ai1

They have some big claims about the capabilities of their models, but the two best ones are unavailable to download. Maybe we can help convince them to release them publicly?

you are viewing a single comment's thread
view the rest of the comments

[–] opi098514@alien.top 1 points 2 years ago (2 children)

It’s the best out there…. But no you can’t try it because it’s to dangerous.

[–] VertexMachine@alien.top 1 points 2 years ago (1 children)

I doubt there is any model really... follow the trail, you'll end up at a company founded by single person from India (who is founder of another company with a single app for collaborative drawing)... that at least doesn't have any employees on LinkedIn...

And the founder looks like a relatively young person that most likely wouldn't be even able to gather the required funding to have enough GPU compute for making model that's better than gpt4 (or know how). I think that's just a front for him trying to get some hype or funding.

[–] opi098514@alien.top 1 points 2 years ago (2 children)

Uuummmm no. It’s for sure real. And the best one out there. No questions asked. It’s better that CHATGPT 4 and OpenAI has been trying to hack this new company to get the 600b model because they are scared that it will end OpenAI for good.

Obligatory /s

[–] aurumvexillum@alien.top 1 points 2 years ago (1 children)

You forgot to mention that your uncle is the CEO of OpenAI! 😉

[–] opi098514@alien.top 1 points 2 years ago

Well that’s because he’s not. Sam is actually my dad.

[–] LetsGoBrandon4256@alien.top 1 points 2 years ago (2 children)

https://in.linkedin.com/company/deepnight

View 1 employee

Work experience: Google Startup Alumni

lmao

[–] opi098514@alien.top 1 points 2 years ago

Everything on that page is hype for something that doesn’t exist.

[–] ananthasharma@alien.top 1 points 2 years ago

A cursory look at the website makes me think these guys don’t know what they are doing

[–] SomeOddCodeGuy@alien.top 1 points 2 years ago (1 children)

Right. This part right here is very suspicious to me, and I'm taking their claims with a grain of salt.

No! The model is not going to be available publically. APOLOGIES. The model like this can be misused very easily. The model is only going to be provided to already selected organisations.

[–] bot-333@alien.top 1 points 2 years ago (1 children)

I think they changed it to it’s still an experiment and they are finishing evaluations to better understand the model.

[–] Illustrious_Sand6784@alien.top 1 points 2 years ago (1 children)

No they haven't, on the 220B model it's always been that message above, while on the 600B model it's a message similar to the one you stated.

[–] bot-333@alien.top 1 points 2 years ago

I guess they might open source the 600B one? They have different names, so maybe different training approaches.