LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Orca 2: Teaching Small Language Models How to Reason (www.microsoft.com)

submitted 2 years ago by Memories-Of-Theseus@alien.top to c/localllama@poweruser.forum

14 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] TheCrazyAcademic@alien.top 1 points 2 years ago

It'd be interesting to see how an MoE framework of multiple Orca 2s each trained on different subsets of data basically routing your prompt to different orca 2 experts would fair. I feel like that can come extraordinarily close to a GPT 4 in performance metrics but would take decent computing power to test the hypothesis. If each orca 2 expert is 10 billion parameters and you wanted to run a 100 billion sparse orca 2 MoE that's gonna require at least 500 gig+ of VRAM at minimum.