this post was submitted on 22 Nov 2023
1 points (100.0% liked)
LocalLLaMA
3 readers
1 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Predicting the loss is very different from predicting real world abilities, they are able to top the former, not the latter.
Predicting the future loss once you’re already 10% into training is fairly trivial. Predicting the actual abilities though is not.