1: What is your budget?
2: Do you have access to the data center and are you able to put in a GPU?
3: Does the server have SXM sockets or just PCIE?
Community to discuss about Llama, the family of large language models created by Meta AI.
1: What is your budget?
2: Do you have access to the data center and are you able to put in a GPU?
3: Does the server have SXM sockets or just PCIE?
Well, the budget is $0 at the experimental stage -- we wanted to see what we could achieve by throwing a lot of RAM at the problem.
If we have to add a GPU, Yeah, we have access to the server. Current box would accommodate a Gen3 PCIe, FHHL slot. Frankly, if we have to invest in a GPU, we'll also upgrade the whole server to a later-gen/more-powerful CPU, NVME storage, etc.
you could get a gpu like a p100 16gb for simple ai work or a v100/A4 for slightly more heavy duty work
p100s only cost around 170$, so it's cheap to upgrade the gpu