this post was submitted on 12 May 2026
197 points (93.8% liked)

Technology

84930 readers
4107 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS
 

In addition to making people stupid, I wonder what affect will LLMs like Claude will have on programmers? How will new programmers learn if companies start using Claude?

you are viewing a single comment's thread
view the rest of the comments
[–] StarryPhoenix97@lemmy.world 4 points 1 week ago* (last edited 1 week ago) (1 children)

That's on my to-do list. I'm currently reworking my entire build because I realized I had enough last generation parts to build a media server. Once I have windows set up to only run on VM and get my stuff moved and backed up I'm going to install an LLM

[–] Franconian_Nomad@feddit.org 2 points 1 week ago (1 children)

I recommend Qwen3.6, either the 27B dense or the 35B MoE model. Both outstanding for local models.

[–] dontbelievethis@sh.itjust.works 2 points 1 week ago (1 children)

What hardware are you using?

I am using qwen3.5 9b. And it is barely working.

[–] Franconian_Nomad@feddit.org 1 points 1 week ago* (last edited 1 week ago)

I have a Radeon RX 7800 XT.

Qwen 3.5-9b is blazingly fast on it. However while it’s its impressive for its size, it has its limitations. Complex tasks with several steps are too much for it.

So now I run the 3.6-35B model with llama.cpp It’s too big for my VRAM so I had to split it: everything that doesn’t fit on the graphics’s card runs in the normal RAM. That slows everything down, but with the right flags I get a bit over 20 tokens/s.

If you have problems with speed and you’re using ollama I would replace it with something faster like llama.cpp.