this post was submitted on 24 Jun 2024

-16 points (38.2% liked)

Selfhosted

49492 readers

886 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago

MODERATORS

ruud@lemmy.world

Loki@lemmy.world

CannaVet@lemmy.world

HybridSarcasm@lemmy.world

devve@lemmy.world

HybridSarcasm@lemmy.hybridsarcasm.xyz

-16

Is it possible to run a LLM on a mini-pc like the GMKtec K8 and K9? (lemmy.world)

submitted 1 year ago* (last edited 1 year ago) by TheBigBrother@lemmy.world to c/selfhosted@lemmy.world

40 comments fedilink hide all child comments

I have experience in running servers, but I would like to know if it's possible to do it, I just need a GPT 3.5 like private LLM running.

all 41 comments

sorted by: hot top controversial new old

[–] StrawberryPigtails@lemmy.sdf.org 10 points 1 year ago (2 children)

It's doable. Stick to the 7b models and it should work for the most part, but don't expect anything remotely approaching what might be called reasonable performance. It's going to be slow. But it can work.

To get a somewhat usable experience you kinda need an Nvidia graphics card or an AI accelerator.

[–] 1rre@discuss.tchncs.de 4 points 1 year ago (1 children)

Intel Arc also works surprisingly fine and consistently for ML if you use llama.cpp for LLMs or Automatic for stable diffusion, it's definitely much closer to Nvidia in terms of usability than it is to AMD

[+] TheBigBrother@lemmy.world -27 points 1 year ago* (last edited 1 year ago) (3 children)

I need it to make academic works pass the anti-AI systems, what do you recommend for that work? It's for business so I need a reasonable good performance but nothing extravagant..

I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

[–] entropicdrift@lemmy.sdf.org 12 points 1 year ago (2 children)

I believe commercial LLMs have some kind of watermark when you apply AI for grammar and fixing in general, so I just need an AI to make these works undetectable with a private LLM.

That's not how it works, sorry.

[–] TheBigBrother@lemmy.world 0 points 1 year ago* (last edited 1 year ago) (2 children)

I was talking about that with a friend some days ago, and they made an experiment, they just made the AI correct punctuation errors of a text document, no words at all which you can easily add manually, and the anti-AI system target 99% AI made, I don't know how to explain that, maybe the text was AI generated also IDK or there is a watermark in some place, a pattern or something.

Edit: you point will be that there is no way to fool the anti-AI systems running a private LLM?

[–] entropicdrift@lemmy.sdf.org 7 points 1 year ago* (last edited 1 year ago) (2 children)

Just that they're no easier to use to fool an anti-AI system than using ChatGPT, Gemini, Bing, or Claude. Those AI detectors also give false positives on works made by humans. They're unreliable in the first place.

Basically, they're "boring text detectors" more than anything else.

[–] TheBigBrother@lemmy.world 0 points 1 year ago

I have a friend who is running a business of doing homework on demand, he is using AI to do the work, he got back a work because AI generated content was detected on it, he used to employ real people to do the work but anyway real people used AI too sometimes, so he knows I'm a "hacker" LMAO and asked me if I knew any way to fool the anti-AI systems, I thought about running a private LLM and training it with real human generated content like ebooks depending on the subject of the work, do you think it could be possible to fool these things with this method?

[+] TheBigBrother@lemmy.world -17 points 1 year ago* (last edited 1 year ago) (2 children)

I have a friend who is running a business of doing homework on demand, he is using AI to do the work, he got a work returned because AI generated content was detected on it, he used to employ real people to do the work but anyway real people used AI too sometimes, so he knows I'm a "hacker" LMAO and asked me if I knew any way to fool the anti-AI systems, I thought about running a private LLM and training it with real human generated content like ebooks depending on the subject of the work, do you think it could be possible to fool these things with this method?

[–] entropicdrift@lemmy.sdf.org 15 points 1 year ago (1 children)

So first of all, you shouldn't involve yourself in your friend's business. Fraud is generally frowned upon.

But secondly, you know that ChatGPT was trained on the entire internet, right? Like, every book. I don't think "more books" is gonna help.

I hope you take your computer skills and make something of yourself. Try not to get any more involved in this scheme, seriously. You don't need this crap marring your reputation.

Besides, there are better reasons/ways to fight the system than helping other people avoid learning.

[+] TheBigBrother@lemmy.world -21 points 1 year ago* (last edited 1 year ago) (4 children)

TBH I'm going down the rabbit hole hard, it's the way I am, if I get an idea I am not happy until it start making money, as I see it is not completely bad, education it's a fucking shitty mess, just a way to get money away of people(making them paying a loan for 30 years) and perpetuating the fake idea of social status, If we get some of these bucks in the way I didn't see what's wrong about it, anyway these dumb people will do their things one way or another.

[–] LunarLoony@lemmy.sdf.org 9 points 1 year ago

if I get an idea I am not happy until it start making money

That sounds extremely unsustainable

[–] hperrin@lemmy.world 4 points 1 year ago (1 children)

You are not a good person if this is how you want to get through life.

[–] LengAwaits@lemmy.world 3 points 1 year ago (1 children)

This is some top tier mental gymnastics. Holy shit, I hope you're a troll. You're literally on the internet discussing your plans to commit fraud. Mensa-level shit, here.

People are going to buy CP one way or another... that means you should make it and sell it to them, right?

Grow the fuck up, and maybe train a LLM on ethics, you're going to need some education on the subject if you hope to stay out of prison.

[–] hperrin@lemmy.world 3 points 1 year ago

Your “friend's” business is very unethical. Maybe your friend should think about what they’re doing with their life, and quit doing this.

[–] al4s@feddit.de 3 points 1 year ago (2 children)

LLMs work by always predicting the next most likely token and LLM detection works by checking how often the next most likely token was chosen. You can tell the LLM to choose less likely tokens more often (turn up the heat parameter) but you will only get gibberish out if you do. So no, there is not.

[–] TheBigBrother@lemmy.world 0 points 1 year ago

What about if you train the AI with human generated content? For example e-books?

[–] hperrin@lemmy.world 10 points 1 year ago (1 children)

Maybe just write the academic works yourself, then they should pass.

[–] MangoPenguin@lemmy.blahaj.zone 1 points 1 year ago

Something with a GPU that's good for LLMs would be best.

[–] Tempo@lemmy.ml 8 points 1 year ago (3 children)

They're Ryzen processors with "AI" accelerators, so an LLM can definitely run on hardware on one of those. Other options are available, like lower powered ARM chipsets (RK3588-based boards) with accelerators that might have half the performance but are far cheaper to run, should be enough for a basic LLM.

[–] exu@feditown.com 3 points 1 year ago

I don't know of any project that already supports that AI processor. You'd still be using the CPU and GPU at the moment.

[–] TheBigBrother@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

The K8 it's Ryzen, the K9 Intel, money isn't a problem and it's not a spending it's a investment I need it for business, which of these two models would you recommend for a reasonable good LLM and Stable Diffusion?

I'm looking for the most cost-effective solution.

[–] MasterNerd@lemm.ee 5 points 1 year ago (2 children)

Look into ollama. It shouldn't be an issue if you stick to 7b parameter models

[–] TheBigBrother@lemmy.world 1 points 1 year ago (1 children)

Yeah, I did see something related to what you mentioned and I was quite interested. What about quantized models?

[–] MasterNerd@lemm.ee 2 points 1 year ago (2 children)

I don't have any experience with them honestly so I can't help you there

[–] TheBigBrother@lemmy.world 1 points 1 year ago

Appreciate you 👍👍

[–] TheBigBrother@lemmy.world -5 points 1 year ago

Appreciate you 👍👍

[–] TheBigBrother@lemmy.world -5 points 1 year ago (1 children)

Yeah, I did see something related to what you mentioned and I was quite interested. What about quantized models?

[–] entropicdrift@lemmy.sdf.org 3 points 1 year ago (2 children)

Quantized with more parameters is generally better than floating point with fewer parameters. If you can squeeze a 14b parameter model down to a 4-bit int quantization it'll still generally outperform a 16-bit Floating Point 7b parameter equivalent.

[–] TheBigBrother@lemmy.world 1 points 1 year ago

Interesting information mate, I'm documenting myself into the subject, thx for the help 👍👍