this post was submitted on 25 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

So RWKV 7b v5 is 60% trained now, saw that multilingual parts are better than mistral now, and the english capabilities are close to mistral, except for hellaswag and arc, where its a little behind. all the benchmarks are on rwkv discor, and you can google the pro/cons of rwkv, though most of them are v4.

Thoughts?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] ambient_temp_xeno@alien.top 1 points 11 months ago (8 children)
[โ€“] satireplusplus@alien.top 1 points 11 months ago

Models are Apache 2.0 afaik, there are not that many base models that can be used commercially without restrictions.

load more comments (7 replies)