this post was submitted on 27 Nov 2023
1 points (100.0% liked)
LocalLLaMA
14 readers
1 users here now
Community to discuss about Llama, the family of large language models created by Meta AI.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I'm willing to wait for quality so that's no problem!
Where can I go to find these models? And how do I set them up and get them running?
If you're on Windows, I'd download KoboldCPP and TheBloke's GGUF models from HuggingFace.
Then you just launch KoboldCPP, select the .gguf file, select your GPU, enter the number of layers to offload, set the context size (4096 for those), etc and launch it.
Then you're good to start messing around. Can use the Kobold interface that'll pop up or use it through the API with something like SillyTavern.