LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Best model-setup for CPU-only? (alien.top)

submitted 2 years ago by andromedians@alien.top to c/localllama@poweruser.forum

11 comments fedilink hide all child comments

Have got some server hardware: One i use for games @ 18c36t 3,2Ghz 128Gb RAM (GTX970 so GPU processing is a no-go i assume) the other similar, but will have 256Gb. What’s best for these?

i’m only starting out and don’t understand the terms and measurements yet, but i am in the process and preparing the softwares to try. i would like to focus around the best options available to me.

Thanks

you are viewing a single comment's thread
view the rest of the comments

[–] andromedians@alien.top 1 points 2 years ago (1 children)

Thanks, was preparing a bit already. Have some variations of the software- and Wizard-Vicuna13B-Uncensored.Q5_K_M, which stood out to me for some reason…lzlv (q5km) from a review in this sub, a 70b one (the best). Can alternate depending on need. All the ones from gpt4all, and 7b(+unfiltered) q4 Lora-s, which for some reason are the only ones hosted on torrent sites.

[–] candre23@alien.top 1 points 2 years ago (1 children)

70b models will be extremely slow on pure CPU, but you're welcome to try. There's no point in looking on "torrent sites" for LLMs - literally everything is hosted on huggingface.

[–] andromedians@alien.top 1 points 2 years ago

i now know…what i was looking for was if maybe some disallowed, more powerful versions are on there…nay…time to start exploring soon