this post was submitted on 28 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

https://huggingface.co/deepnight-research

I'm not affiliated with this group at all, I was just randomly looking for any new big merges and found these.

100B model: https://huggingface.co/deepnight-research/saily_100B

220B model: https://huggingface.co/deepnight-research/Saily_220B

600B model: https://huggingface.co/deepnight-research/ai1

They have some big claims about the capabilities of their models, but the two best ones are unavailable to download. Maybe we can help convince them to release them publicly?

you are viewing a single comment's thread
view the rest of the comments
[–] opi098514@alien.top 1 points 11 months ago (2 children)

It’s the best out there…. But no you can’t try it because it’s to dangerous.

[–] SomeOddCodeGuy@alien.top 1 points 11 months ago (1 children)

Right. This part right here is very suspicious to me, and I'm taking their claims with a grain of salt.

No! The model is not going to be available publically. APOLOGIES. The model like this can be misused very easily. The model is only going to be provided to already selected organisations.

[–] bot-333@alien.top 1 points 11 months ago (1 children)

I think they changed it to it’s still an experiment and they are finishing evaluations to better understand the model.

[–] Illustrious_Sand6784@alien.top 1 points 11 months ago (1 children)

No they haven't, on the 220B model it's always been that message above, while on the 600B model it's a message similar to the one you stated.

[–] bot-333@alien.top 1 points 11 months ago

I guess they might open source the 600B one? They have different names, so maybe different training approaches.

[–] VertexMachine@alien.top 1 points 11 months ago (1 children)

I doubt there is any model really... follow the trail, you'll end up at a company founded by single person from India (who is founder of another company with a single app for collaborative drawing)... that at least doesn't have any employees on LinkedIn...

And the founder looks like a relatively young person that most likely wouldn't be even able to gather the required funding to have enough GPU compute for making model that's better than gpt4 (or know how). I think that's just a front for him trying to get some hype or funding.

[–] opi098514@alien.top 1 points 11 months ago (2 children)

Uuummmm no. It’s for sure real. And the best one out there. No questions asked. It’s better that CHATGPT 4 and OpenAI has been trying to hack this new company to get the 600b model because they are scared that it will end OpenAI for good.

Obligatory /s

[–] LetsGoBrandon4256@alien.top 1 points 11 months ago (2 children)

https://in.linkedin.com/company/deepnight

View 1 employee

Work experience: Google Startup Alumni

lmao

[–] opi098514@alien.top 1 points 11 months ago

Everything on that page is hype for something that doesn’t exist.

[–] ananthasharma@alien.top 1 points 11 months ago

A cursory look at the website makes me think these guys don’t know what they are doing

[–] aurumvexillum@alien.top 1 points 11 months ago (1 children)

You forgot to mention that your uncle is the CEO of OpenAI! 😉

[–] opi098514@alien.top 1 points 11 months ago

Well that’s because he’s not. Sam is actually my dad.