this post was submitted on 28 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] AutomataManifold@alien.top 1 points 11 months ago (1 children)

They provide the source code for generating your own dialog datasets. Interesting.

https://github.com/skywalker023/sodaverse

[โ€“] AutomataManifold@alien.top 1 points 11 months ago

It's fairly easy to get it to talk to the continuation endpoint in the server for text-generation-webui or llama.cpp instead of OpenAI; actually the painful part was reformatting it to use an instruction format. Just plugging it in to the chat endpoint might work better.

Just prefixing the prompt with some random facts about a fictional world is enough to steer the generation in a way that makes the conversations mention enough stuff about your world to generate a few hundred thousand high-quality conversations with a 13B Llama model. They look like they're pretty diverse, but obviously I haven't had time to train anything on the generated data.

That's probably enough for most applications. Next level is probably generating a world-specific symbolic knowledge distillation so it include elves and dragons in the source. That looks like it requires more accuracy, but they got good enough results with GPT-3 so it's probably feasible. A lot of applications will probably be fine with just generating custom Sodaverse data.