LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

submitted 2 years ago by yahdahduhe@alien.top to c/localllama@poweruser.forum

9 comments fedilink hide all child comments

I'm currently using 1650 4GB, AMD 5600, 32GB RAM.

I got some spare cash to throw to learn more about local llm.

Should I get: A. 64 GB RAM (2 X 32GB) B. 3060 12GB C. Intel A770 16GB.

I'm using openhermes 2.5 Mistral 7b q5k_m gguf, ok-ish Performace for Silly tavern with koboldcpp. But when context goes above 3k, it crawled.

Please let advise which option you think I should take first. Thanks bunch.

you are viewing a single comment's thread
view the rest of the comments

[–] tech92yc@alien.top 1 points 2 years ago

get a used Nvidia GPU , the Cuda acceleration changes everything (x20-x50 performance)

Don't waste your time on CPU inference, also Intel A770 doesn't have the software support