this post was submitted on 15 Nov 2023
1 points (100.0% liked)

LocalLLaMA

1 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 10 months ago
MODERATORS
 

nvidia released a new 8B base model (and a few fine-tunes), albeit under a restrictive license.

https://huggingface.co/nvidia/nemotron-3-8b-base-4k

Happily, they did specify enough details about their training regimen for the model to be a useful data-point.

They also note that they trained on all the training sets for all the popular benchmarks, which...at least they're honest about.

you are viewing a single comment's thread
view the rest of the comments
[–] ntn8888@alien.top 1 points 10 months ago

an 8b model? surely releasing larger ones is good for their own game :/