LocalLLaMA

4 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Felladrin/TinyMistral-248M-Alpaca (huggingface.co)

submitted 2 years ago by Thistleknot@alien.top to c/localllama@poweruser.forum

1 comments fedilink hide all child comments

top 1 comments

sorted by: hot top controversial new old

[–] Thistleknot@alien.top 1 points 2 years ago

I was going to try to knowledge distill but they modified their tokenizer.

Either way neo has a 125M model, so a 248M model is x2 that. I imagine this could be useful for shorter context tasks. Idk, or to continue training for very tight uses cases

I came across it while looking for tiny mistral config jsons to replicate⁸

https://preview.redd.it/l9l7a39u3a1c1.jpeg?width=720&format=pjpg&auto=webp&s=80589cb6fbb2268b0d8af65b4ec27647185b4780

permalink
fedilink
source