this post was submitted on 13 Nov 2023
1 points (100.0% liked)

LocalLLaMA

3 readers
1 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 1 year ago
MODERATORS
 

Using and losing lots of money on gpt-4 ATM, it works great but for the amount of code I'm generating I'd rather have a self hosted model. What should I look into?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] leepenkman@alien.top 1 points 1 year ago (1 children)

i would grab a server like vllm or text-generator.io (open source too)
Then get a model like others have suggested like deepseek or something to put in the server (both those servers are OpenAI compatible so makes switching easy)

[โ€“] Charuru@alien.top 1 points 1 year ago

I've not heard of text-generator.io, is it as performant as vllm on multibatch or is it a wrapper around it?