LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Models Megathread #2 - What models are you currently using? (alien.top)

submitted 2 years ago by Technical_Leather949@alien.top to c/localllama@poweruser.forum

56 comments fedilink hide all child comments

As requested, this is the subreddit's second megathread for model discussion. This thread will now be hosted at least once a month to keep the discussion updated and help reduce identical posts.

I also saw that we hit 80,000 members recently! Thanks to every member for joining and making this happen.

Welcome to the r/LocalLLaMA Models Megathread

What models are you currently using and why? Do you use 7B, 13B, 33B, 34B, or 70B? Share any and all recommendations you have!

Examples of popular categories:

Assistant chatting
Chatting
Coding
Language-specific
Misc. professional use
Role-playing
Storytelling
Visual instruction

Have feedback or suggestions for other discussion topics? All suggestions are appreciated and can be sent to modmail.

^(P.S. LocalLLaMA is looking for someone who can manage Discord. If you have experience modding Discord servers, your help would be welcome. Send a message if interested.)

Previous Thread | New Models

you are viewing a single comment's thread
view the rest of the comments

[–] FutureIsMine@alien.top 1 points 2 years ago (1 children)

Llama2-70B for generating the plan and than using CodeLlama-34B for coding, or LLama-13B for executing the instructions from LLama-2-70B

Currently in the process of exploring what other models to add once LLama2-70B generates the plan for what needs to get done

[–] Mr_Finious@alien.top 1 points 2 years ago (1 children)

What do you mean by generating the plan? Can you describe your workflow ?

[–] FutureIsMine@alien.top 1 points 2 years ago (1 children)

Lets say you've got a task like write a blog post. Instead of issuing a single command, have a GPT model plan it out. Something akin to

system: You are a planning AI, you will come up with a plan that will assist the user in any task they need help with as best you can. You will layout a clear and well followed plan.   
User: Hello Planner AI, I need your help with coming up with a plan for the following   task : {user_prompt}

So the now LLama2-70B generates a plan that has steps in it that are numbered. Next, you can regex on the numbers and than pass that along to the worker model that will execute the task. As LLMS write more than humans and add in additional details that LLMS can follow, the subsequent LLMs will do a better job in executing the task rather than if you asked a smaller model write me a blog post about 3D printing D&D minis. Now go replace the task of writing a blog post with whatever it is you're doing and you'll be getting results

[–] Mr_Finious@alien.top 1 points 2 years ago

Wow. Thank you so much for this explanation !!! ❤️