Llama2-70B for generating the plan and than using CodeLlama-34B for coding, or LLama-13B for executing the instructions from LLama-2-70B
Currently in the process of exploring what other models to add once LLama2-70B generates the plan for what needs to get done
Lets say you've got a task like
write a blog post
. Instead of issuing a single command, have a GPT model plan it out. Something akin toSo the now LLama2-70B generates a plan that has steps in it that are numbered. Next, you can regex on the numbers and than pass that along to the worker model that will execute the task. As LLMS write more than humans and add in additional details that LLMS can follow, the subsequent LLMs will do a better job in executing the task rather than if you asked a smaller model
write me a blog post about 3D printing D&D minis
. Now go replace the task of writing a blog post with whatever it is you're doing and you'll be getting results