this post was submitted on 25 Nov 2023
1 points (100.0% liked)
LocalLLaMA
3 readers
1 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
SIGNIFICATNLY less - it is not a transformer that goes totally quadratic.
It is not a transformer?
Nope, RNN without attention, with some tricks for enabling parallel training.
Its basically... 0?
From github: