overview for noobgolang

This model is extremely good in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago

I use this

https://nitro.jan.ai/

Politically balanced chat model? in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago (1 children)

Are you balanced yourself? or it's just your false ideas of balance?

1

This model is extremely good (alien.top)

submitted 9 months ago by noobgolang@alien.top to c/localllama@poweruser.forum

15 comments fedilink

I have been using this as daily driver for a few days, very good, i never thought 7B model can achieve this level of coding + chat
https://huggingface.co/TheBloke/OpenHermes-2.5-neural-chat-7B-v3-1-7B-GGUF

Need help setting up a cost-efficient llama v2 inference API for my micro saas app in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago

for cuda version you can use this link for linux version https://github.com/janhq/nitro/releases/download/v0.1.17/nitro-0.1.17-linux-amd64-cuda.tar.gz , you need to make sure the system has cudatoolkit. i remcommend following the exact step in quickstart docs here https://nitro.jan.ai/quickstart to make sure it will work

Need help setting up a cost-efficient llama v2 inference API for my micro saas app in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago (1 children)

m1 models of apple and on main page it mentions m2 models as well?

yeah arm64 mac should be able to run on all mac m1 and m2 including, we also have cuda version in the release

Need help setting up a cost-efficient llama v2 inference API for my micro saas app in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago (3 children)

also the build is 100% built in public with the source code on the page, you can check the Actions button to see it, there is nothing hidden here

Need help setting up a cost-efficient llama v2 inference API for my micro saas app in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 9 months ago (7 children)

you can try https://nitro.jan.ai/ its built for this purpose

Selfhosted Citavi/Zotero alternative? in c/main@selfhosted.forum

[–] noobgolang@alien.top 0 points 9 months ago

a simple file system will do that?

Nitro - 3mb binary to self-host LLMs in c/main@selfhosted.forum

[–] noobgolang@alien.top 1 points 10 months ago

sort-of, but it's lighter and small, suitable to run as a subprocess for your app

0

Nitro - 3mb binary to self-host LLMs (alien.top)

submitted 10 months ago by noobgolang@alien.top to c/main@selfhosted.forum

4 comments fedilink

https://nitro.jan.ai/

Best Local LLM Backend Server Library? in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 10 months ago

https://nitro.jan.ai/

Intel neural-chat-7b-v3-1 in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 10 months ago

i self host it on my homelab very good

Best Local LLM Backend Server Library? in c/localllama@poweruser.forum

[–] noobgolang@alien.top 1 points 10 months ago (1 children)

Disclosure : I’m the maintainer of nitro project

We have a simple llama server with just single binary that you can download try right away here https://github.com/janhq/nitro it will be a viable option if you want to set up an openai compatible endpoint to test out new model