LocalLLaMA

11 readers

4 users here now

Community to discuss about Llama, the family of large language models created by Meta AI.

founded 2 years ago

MODERATORS

communick@poweruser.forum

Is LocalLLaMA on RunPod cheaper than Chat GPT4 for text prompts? (alien.top)

submitted 2 years ago by allun11@alien.top to c/localllama@poweruser.forum

4 comments fedilink hide all child comments

I have a query which costs around 300 tokens, and as 1000 tokens cost 0,06 USD that translates to roughly 0,02 USD for that request.

Let say I would deploy a LocalLLaMA on RunPod, on one of the cheaper machines, would that request be cheaper than running it on GPT4?

you are viewing a single comment's thread
view the rest of the comments

[–] tenmileswide@alien.top 1 points 2 years ago

Depends entirely on what model you want. The llama-2 13b serverless endpoint would only cost $0.001 for that request on Runpod.

If you rent a cloud pod it's going to cost the same per hour no matter how much or little you send to it so it's based entirely on the number of requests you can get sent to it.