this post was submitted on 26 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

I'm confused by all these prefixes that appear in the finetunes of base models. Is there a glossary of all these words and similar ones?

top 9 comments
sorted by: hot top controversial new old
[–] swagonflyyyy@alien.top 1 points 11 months ago

Hermes is the messenger of the gods. It is a metaphor. I'm sure the rest of them have their own meaning as well.

[–] ihexx@alien.top 1 points 11 months ago

they are just made up names. People choose to name their projects whatever. SOmetimes it's related to the prior work it's based on (like underlying model or dataset), but it's just arbitrary.

[–] supremeevilution@alien.top 1 points 11 months ago

Remember when Android named their updates after desserts? Kinda like that.

[–] Feztopia@alien.top 1 points 11 months ago

You need to name the models somehow

[–] BlueIdoru@alien.top 1 points 11 months ago

People make up names.

[–] vicks9880@alien.top 1 points 11 months ago (1 children)

These are just names, LLaMA originally meant ( Large language model Meta AI), but it appears that its also the name of South American animal, thus creative people of internet who download those weights, fine-tuned it and published it with the other animals of same family like alpaca, vicuna, dalai (llama) etc.

There are more important information in these model names, which are suffix, like parameter counts13B, quantization methods GGUF/GGML, fine-tuning techniques like LoRA, fine-tuning parameters like Q6_K_M etc are used.

[–] prtt@alien.top 1 points 11 months ago

the other animals of same family like alpaca, vicuna, dalai (llama)

One of these is definitely not like the others.

[–] irregardless@alien.top 1 points 11 months ago

I'll go out on a limb and say that no one has compiled a glossary or encyclopedia of the various fine-tunes that seem to get published every day (if I'm wrong I'm sure someone will correct me). If you're not connected to "the scene", or working with these models academically/professionally, it can be hard to become and stay initiated into the "secret" jargon that's developed around local LLM. You can pick up a lot just by hanging out here, but you'll still run into quite a few things that make you ask "wtf does that mean?".

[–] localhost80@alien.top 1 points 11 months ago

I'm confused by all these usernames that appear on Reddit. Is there a glossary of all these usernames and similar ones?

What do these names mean? Nderstand2grow, ...