this post was submitted on 28 Nov 2023
1 points (100.0% liked)
LocalLLaMA
3 readers
1 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It's not that CPUs are slow it's that typically RAM that the CPU is connected to is slow.
That's why unified memory is fast it's just faster and connected to the CPU.
The UMA has a lot more to do with the speed than distance, and GPU has a much different architecture and memory access patterns than a CPU.