- Deepseek 67B still beats XVERSE-65B in the benchmarking scores.
- The benchmarks indicate strong math and coding performance for these two model series.
- Yuan has a unique optional attention mechanism that enhances output quality
this post was submitted on 29 Nov 2023
1 points (100.0% liked)
LocalLLaMA
11 readers
4 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments