LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Pay as you go for opensouce models (alien.top)

submitted 2 years ago by OpeningRecognition69@alien.top to c/localllama@poweruser.forum

7 comments fedilink hide all child comments

I dont have budget for hosting models on dedicated GPU, what are the alternative options or platforms that let me use Opensource models like mistral, Llamas, etc in a pay per API call basis ?

top 7 comments

sorted by: hot top controversial new old

[–] pictoria_dev@alien.top 1 points 2 years ago

I'm currently exploring different models too, in particular for coding. Tried deepseek-coder on their official website and it was good. Unfortunately they collect chat data. Anyone know of a pay-as-you-go services that offers this model?

[–] theodormarcu@alien.top 1 points 2 years ago

What's the use case? Chatting with them, or for your own apps?

Check out open router too

[–] teddybear082@alien.top 1 points 2 years ago

Openrouter and some of the models hosted are free.

Google colab but depends on how long google will let you use it for free (you can also pay monthly)

[–] TradingDreams@alien.top 1 points 2 years ago

It may be out of your range, but you can pick up the dell precision 7720 with a 16gb P5000 GPU for about $500 on eBay. The Quadro P5000 is also in a few other workstation laptop models around that era. Note: They had other graphics options so only go for P5000 models.

[–] DarthNebo@alien.top 1 points 2 years ago

HuggingFace has inference endpoint which is private & public as needed with sleep built in

[–] sbashe@alien.top 1 points 2 years ago

https://www.anyscale.com/endpoints#hosted Good service. I use all it the time. Also has fine-tuning options if u need.

[–] ThisGonBHard@alien.top 1 points 2 years ago

One I used before is runpod.io, but it is a pay per time platform, not API.